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PROCESS FOR PARTITIONING OF PROTEINS 

Field of the invention 

The present invention relates to isolation and purification of proteins in aqueous two-phase 
systems (ATPS). Specifically the invention provides processes for partitioning of molecules 
of interest in ATPS by fusing said molecules to targeting proteins which have the ability of 
carrying said molecule into one of the phases. 

Background of the invention 

Liquid-liquid extraction in an aqueous two-phase system (ATPS) can offer a powerful 
technique for isolation and purification of proteins. The separation of macromolecules and 
particles by means of liquid-liquid extraction is well known ( Albert sson, 1 986; Walter et al , 
1985; Kula, 1990). Mainly polyethylene glycol (PEG) - salt, PEG-dextran and PEG-starch 
systems have been in use. More recently detergents and detergents with reversed solubility 
were discovered as suitable methods for separation of macromolecules, and especially for 
the separation of proteins. 

An advantage of aqueous two-phase systems (ATPS) is that they are especially suited for 
large scale processing of microbial proteins not only from culture supernatants but also from 
crude extracts containing cells and cell debris (Kula, 1979; Kula, 1985). Characteristic 
features of biological fluids as well as suspensions are rather small particle sizes, low density 
differences between fluid and suspended solids, high viscosities of the extracts and high 
compressibility of the solids (Hustedt et a/., 1985; Bender and Koglin, 1986). These 
attributes decrease the performance of conventional methods for solid-liquid separation like 
centrifugation and filtration at the beginning of a protein recovery process. Using an 
aqueous two-phase system removal of solids can be integrated into a liquid-liquid separation 
step, clarification is thus combined with an initial purification (Kula, 1979; Kula, 1985). 

After the extraction process phase separation can be accomplished by settling under gravity 
as well as by centrifugation (Kula, 1985). ATPS can be applied in various scales from very 



t \ 
t 

WO 00/58342 PCT/FIOO/00249 

2 

small laboratory scale to large industrial scale thus suiting for various proteins, purposes and 
needs. With regard to industrial purposes commercially available centrifugal separators can 
be used to shorten separation time. Several authors have investigated the potential of 
centrifugal separators of various design for processing of large volumes of aqueous two 
5 phase systems (Kula, 1979; Kula et al., 1981, Kula et a/., 1982; Kula, 1985). In these 
studies the authors have used polymer/polymer or polymer/salt systems and the results of 
these investigations demonstrate the feasibility of continuous separation of aqueous two- 
phase systems in centrifugal separators. 

Extraction systems based on nonionic surfactants have been described as an alternative to 
standard polymer/polymer or polymer/salt systems. Phase forming surfactants are e.g. 
polyoxyethylene type nonionic detergents. The basis of this type of aqueous two-phase 
system is the temperature-dependent reversible hydration of the polar ethylene oxide head 
groups. The temperature at which the phase separation occurs is referred to as the cloud- 
point (cloud-point extraction). This kind of aqueous two-phase system is especially suited 
for the extraction of amphiphilic biomolecules. The potential of this type of two-phase 
system for separating membrane bound proteins from cytosolic and peripheral membrane 
proteins was first demonstrated by Bordier (1 98 1). Heusch and Kopp (1988) have been able 
to demonstrate that lamellar structures formed in the miscibility gaps of polyglycol ether / 
water systems are responsible for the selective extraction of hydrophobic substances. 

Recently, the successful application of a surfactant-based aqueous two-phase system for the 
extraction of a membrane bound protein (cholesterol oxidase) from the unclarified culture 
medium of the gram-positive microorganism Nocardia rhodochroas on a bench scale has 
25 been reported (Minuth et al., 1 995). By addition of only one chemical compound a product 
release through solubilization was possible in homogeneous phase and in a second step a 
clarification as well as an initial purification was achieved by an extraction process at 
elevated temperatures separating the detergent rich phase. A closed concept was further 
developed for the production of the membrane bound enzyme by surfactant-based 
30 extraction, organic solvent extraction and anion-exchange chromatography, which gave a 
product suitable for analytical applications (Minuth et al., 1996). 
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In aqueous two phase systems the desired target e.g. a protein should partition selectively 
into one phase (preferentially the lighter phase) while the other substances should partition 
into the other phase (preferentially the heavier phase). In PEG/salt and PEG/dextran and 
similar systems there are several driving forces for a substance like charges, hydrophobic, 
hydrophilic forces or the dependence on conformation or ligand interaction (Albertsson, 
1986). The forces leading to separation in detergent based aqueous two phase systems are 
suggested to be primarily hydrophobic (Terstappen et al y 1993). Even if a lot of work has 
been carried out in the field of prediction in ATPS, none of the designed models provides 
a physical picture of the phase behaviour and prediction is hardly possible (Johansson etal., 
1998). 

In ATPS the partitioning coefficient is defined as the concentration (activity in case of an 
enzyme) of the target in the top phase divided by the concentration (enzyme: activity) of the 
target protein in the bottom phase. Partitioning coefficients in ATPS systems are usually in 
the range from less than 1 up to less than 100 (Terstappen et al. y 1992; Terstappen et a!., 
1993). 

C iJB 

Yield: is defined as the amount of target in the top phase divided by the sum of the amount 
of target in top and bottom. This leads to the following equation 



1 + 



If the desired substance is directed to the heavier phase (as it can be the case using Triton) 
the yield is defined by 
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The volume ratio of the two coexisting phases are defined by the volumes of the lighter over 
the heavier phase, respectively. 




An example of useful proteins facing problems in purification in a cost-effective way are the 
commonly used industrial enzymes used as biocatalysts, the glycosyl hydrolases, proteases 
and lipases produced by fungi and bacteria. These are used in e.g. laundry, textile, paper and 
pulp, food and feed industry. The fact that microbes produce many different enzymes during 
their growth and the fact that some of these may be undesired in certain applications leads 
to a need to enrich the active component(s). This enrichment can be performed by choosing 
appropriate growth conditions, by genetic engineering and/or by down-stream processing 
(e.g. purification of the active component(s)). 

Purification of proteins are generally performed by chromatography. Usually gel- 
chromatographic methods are used based on ion-exchange, hydrophobic interaction, affinity 
chromatography and molecular sieving. Methods like electrophoresis and crystallisation can 
also be used. These methods are well known in the art and suitable for proteins of fairly high 
market value. In case of bulky enzyme production these methods, however, are too 
expensive in order to keep the final product on a compatible price level. Due to similar 
properties of these enzymes several purification steps are usually needed to separate the 
proteins from each other. This often causes low final yields and therefore a high loss of 
product. 

Many extracellular hydrolases produced by the filamentous fungus Trichoderma are 
currently used in different industrial applications in large scale. These hydrolases are e.g. 
hemicellulases (such as xylanases and mannanases), cellulases (such as endoglucanases and 
cellobiohydrolases) and proteases. Purification of these is well known in the art (Bhikhabhai 
et al., 1984; Pere eta/., 1995), but for large industrial applications the purification methods 
are too expensive. Alternative methods to enrich these hydrolases have been used, including 
deletion of undesired genes by genetic engineering (Suominen et al. 9 1 992). However, even 
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after extensive genetic engineering some minor undesired activities may still be present in 
the final product. 

ATPS have been studied in purification of cellulases of T. reesei and the purification of an 
endoglucanase III showed some promising results, enriching the yield of the protein in the 
upper phase (U.S. Pat 5,139,943). ATPS have also been studied in purification of lipases, 
endoxylanase and natamycin (EP 0 574 050 Al). No K and Y values are, however, 
mentioned. 

As in other protein purification methods, similar properties of proteins produced by an 
organism impair also in ATPS, e.g. selective separation of one protein is not achieved 
optimally. To obtain selectivity in purification affinity chomatographic methods are used 
especially for analytic purposes and in purification of high-value products. These include 
immunoaffinity chromatography and various fusion protein strategies well described in the 
art such as fusing the protein of interest to an other protein (e.g. glutathione-S-transferase), 
protein domain (e.g. protein A-ZZ domain) or small peptide (e.g. His-tag), which selectively 
bind to the solid carrier and thus the recovery of the fusion partner is obtained as well. The 
fusion protein can be suitable for the particular purpose as such or cleavage of the product 
from the added fusion partner may be desired. There are well-known methods in the art on 
cleavage of fusion proteins from their partners by proteases, e.g. by factor X, thrombin or 
papain or by genetically introducing a protease cleavage site (e.g. Kex2 site) or autoproses- 
sing domains (e.g. Intein, New England Biolabs) or by chemical cleavage (e.g. CNBr). 

ATPS offer advantages mainly with respect to technology compared with the solid state 
based separation systems e.g. affinity column-based techniques. The scale-up of extractive 
enzymes is relatively simple utilising commercially available equipment and machinery 
common in the chemical industry. In addition, it can be used in a continuous process and it 
can be relatively cost extensive. It can be used as a single step for clarification, concentration 
and purification. ATPS can be used as a first capture-step, but for bulk products often no 
further purification is needed. 
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To aid selective separation in two-phase systems, recent publications have described the 
fusion of small peptide tags of 12 amino acids to the protein to be purified. The most 
successful of these soluble peptides are containing tryptophans. So far they have mainly been 
applied for very small molecules like the staphylococcal protein A derivative ZZTO 
5 (Berggren et al. 9 1999; Hassinen et a/., 1994; Kohler et al, 1991). 

Use of ATPS has so far been limited to certain targets. Due to the advantages of ATPS in 
protein separation, purification and localisation, highly selective and powerful methods 
should be developed. This is especially important for large scale processes where ATPS in 
10 general is very inexpensive as a first capture step or as the only step for purification, 
clarification and concentration.The system should be universal so that the technique would 
be strong enough to mediate separation of in principle any component to the desired phase 
irrespective of its size or biochemical properties. 

15 Description of the invention 

Tn this invention we describe selective separation and partitioning of molecules and particles 
fusing them with targeting proteins having the capability to carry the molecule or particle 
of interest to the desired phase in ATPS, and to keep it in this phase if wanted. This 

20 invention is directed to make ATPS usable for every biotechnological product. By addition 
of the targeting protein to selective products, either by genetic tagging of proteins, by 
chemical binding, glueing or by use of any other technique, the product molecule can be 
turned more suitable for separation in ATPS. Using ATPS the product or certain component 
is therefore driven to one phase while the other components or by-products are directed to 

25 the other phase(s). 

We also describe that efficient separation in ATPS can be obtained using targeting proteins 
which are/can be larger than the described small soluble synthetic peptide tags of 1 2 amino 
acids or less. These targeting molecules can aid in separating of small molecules but even 
30 large proteins and particles. Unlike the small peptide tags, it is not necessary that they 
contain tryptophan residues, although they may do so. They can be hydrophobic or 
moderately hydrophobic and/or amphipathic in nature, either in monomeric form or when 
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forming aggregates. Such proteins can be found in nature or they can be designed, or 
obtained through for instance methods known in art for mutant generation, gene shuffling 
or directed evolution. Suitable targeting molecules can be screened for instance by fusion 
the product of interest to a library of natural or mutant sequences, and screening the ability 
5 of the fusion molecules to separate in ATPS. Furthermore, any molecule capable of 
separating in ATPS is a suitable targeting molecule. 

Several techniques exploiting purified protein for isolation of the corresponding gene may 
be used to find genes encoding suitable targeting molecules for ATPS. Suitable proteins or 

10 polypeptides may be purified on the basis of their properties. They can be obtained by 
applying the cells, cell extracts or culture media to ATPS and recovering the proteins or 
peptides separated into the phase containing the hydrophobic phase material Suitable 
targeting molecules may also be recovered for example from the culture medium foam 
formed either during the cultivation of a microorganism or caused by bubbling gas through 

15 the medium. Proteins and peptides suitable as targeting molecules may further be recovered 
from aggregates caused by freezing of culture media. After the targeting molecules have 
been purified, the corresponding genes are isolated using techniques known to a person 
skilled in the art. Such techniques include for example screening of expression libraries using 
antibodies raised against purified polypeptide or peptide, and PCR cloning and screening of 

20 genomic and/or cDNA libraries with oligonucleotides designed on the basis of N-terminal 
or internal protein sequences. 

Examples of molecules suited as targeting proteins in ATPS found in nature are 
hydrophobin-like small proteins. Hydrophobic are secreted proteins with interesting 

25 physico-chemical properties that have recently been discoverd from filamentous fungi 
(Wessels, 1994; Wosten and Wessels, 1 997; Kershaw and Talbot, 1998). They are usually 
small proteins, approximately 70 to 1 60 amino acids, containing eight cysteine residues in 
conserved pattern, and do usually not contain tryptophanes. However, also multimodular 
proteins with one or several hydrophobin domains and e.g. proline-rich or asparagine/glycine 

30 repeats, or hydrophobins containing less than eight cysteine residues have been characterized 
(Lorae/a/., 1994; Lora etal, 1995; Arntz and Tudzynski, 1997). Hydrophobins have been 
divided into two classes based on their hydropathy profiles (Wessels, 1994). 
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Today most protein data exists for the hydrophobins Sc3 p of Schizophyllum commune (class 
I), and cerato-ulmin of Ophiostoma ulmi and cryparin of Cryponectria parasitica (class II), 
although more than 30 gene sequences for hydrophobins have been published (Wosten and 

5 Wessels, 1997). HFB genes are often naturally highly expressed but due to special 
requirements in cultivation conditions and the biochemical properties of the proteins, 
purification of HFBs in large amounts have been difficult. For instance only relatively 
moderate production levels of a few mg per liter of Sc3 hydrophobin in static cultures are 
obtained (Han Wosten, personal communication). Published purification procedures include 

10 e.g. multi-step extraction from fungal cell walls using organic solvents and bubbling or 
freezing of culture filtrates (Wessels, 1994). No reports of successful production of 
hydrophobins are available; levels of cerato-ulmin were no higher than those obtained with 
other naturally occuring fungal isolates (Temple et al., 1997). 

15 Upon shaking hydrophobin-containing solutions, the protein monomers form rodlet-like 
aggregates. These structures are similar to the ones found on surfaces of aerial stuctures. 
The self-assembly of purified Sc3 hydrophobin into a 10 nm thick amphipatic layer on 
hydrophilic and hydrophobic surfaces has been demonstrated (Wosten et al. , 1 994a; Wosten 
et al , 1 994b). This film is very strongly attached to the surface and not broken, for instance, 

20 by hot detergent. The hydrophobic side of the layer on hydrophilic surfaces shows properties 
similar to teflon (Wessels, 1994). The Sc3 assemblages, as well as those of cerato-ulmin and 
cryparin, also form on gas-liquid or gas-air interphases thus stabilizing air bubbles or oil 
droplets in water 

25 Surface activity of proteins is generally low but hydrophobins belong to surface-active 
molecules, their surfactant capacity being at least similar to traditional biosurfactants such 
as glycolipids, lipopetides/lipoproteins, phospholipids, neutral lipids and fatty acids (Wosten 
and Wessels, 1997). In fact Sc3 hydrophobin is the most potent biosurfactant known. It 
lowers the water surface tension to 24 mJm 2 at a concentration of 50 ^g/ml due to a 

30 conformational change during self-assembly of monomers into an amphipathic film (Wosten 
and Wessels, 1997). 
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Hydrophobin-like molecules vary in their properties. For instance, rodlet-forming capacity 
has not been assigned for all hydrophobins (such as some class II), or they might have a 
weaker tendency to form stable aggregates (Russo et al. 7 1992; Carpenter et al, 1992). 
Another group of fungal amphiphatic proteins are repellents (Wosten et al. y 1 996 (Ustilago), 
for review, see Kershaw and Talbot, 1998). Consequently, other type of proteins suited as 
targeting proteins for ATPS, may have only some of the features assigned to hydrophobins. 
Other suitable proteins are hydrophobic ones such as e.g. lipases, cholesterol oxidase, 
membrane proteins, small peptide drugs like nisin, aggregating cell wall proteins, lipopetides 
or any parts of these or combinations of these, and other molecules like glycolipids, 
phospholipids, neutral lipids, fatty acids in combination with proteins or peptides. 

In this invention the targeting protein, such as a hydrophobin-like protein or parts of it, is 
bound to the product molecule or the component to be separated. First, phase forming 
materials and eventually possibly also additional salts are added to a watery solution 
containing the fusion molecule or component, and optionally also the contaminating 
materials. The added agents are mixed to facilitate their solubilization. As soon as they are 
solubilized the two phases are formed either by gravity settling or centrifugation. In the 
separation the target protein drives the product to for instance the detergent-rich phase 
which could either be the top or the bottom phase. The method is not only useful for 
purification of products of interest but also for keeping the product or the component of 
interest, such as a biocatalyst, in a particular phase which enables certain useful biotechnical 
reactions. 

Several ATPS systems are suitable for performing this invention. These include PEG 
containing systems, detergent based systems and novel thermoseparating polymers. 
Detergent based systems can be nonionic, zwitterionic, anionic or kationic. The system can 
be based on amphiphilic polymeric detergents, micelle forming polymers. Novel polymers 
can be based on polyethylene-polypropylene copolymers such as pluronic block copolymers, 
Brij, polyoxyethylene derivatives of partial ethers of fatty acids made by adding polyoxyet- 
hylene chains to the nonesterified body and polyoxyethylene derivatives. The well known 
PEG/salt, PEG/dextran and PEG/starch (or derivatives such as Reppal, hydroxipropyl 
starch) systems where PEG and water are forming the top phase and dextran/starch/salt and 
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water are forming the bottom phase. As salts are used phosphate, citrate, sulfate or others. 
In the present process the target is partitioning mainly to the top phase, while most of the 
contaminants are separating mainly to the bottom phase. Some hydrophobic contaminants 
might partition to the top phase as well. Using detergent based systems only one phase 

5 forming detergent has to be added. Optionally, salts and other chemicals can be used in 
addition. The mentioned chemicals are added, and the solution is mixed. After mixing the 
separation can take place either by centrifugation or gravity settling. In order to separate 
into two phases the temperature of the solution has to be over the cloud-point of the 
detergent. The solution has to be heated if the cloud-point is not reached otherwise. If 

10 wanted, a second separation step can follow after a first extraction step and the product rich 
phase can be further purified. Also the remaining product in the product poor and by- 
product rich phase can be re-extracted. Very good K values can be obtained and the yields 
and concentration factors are high. 

15 The process of the present invention can advantageously be used in laboratory scale but is 
especially suitable for large scale separations. It can successfully be used in the separation 
of proteins and components from large fermentations. Using genetic modifications, the 
method can be used to purify any protein of interest including extracellular enzymes and 
proteins such as cellulases and hemicellulases from mixtures containing large amounts of 
20 protein such as several grams per liter. Furthermore, this separation can be obtained from 
various culture media including industrial media containing particular materials such as 
cellulose and spent grain. The method can be used to purify the product from culture media 
of strains modified not to produce endogenous hydrophobins. The separation can be done 
directly from the fermentation broth which can additionally contain cells, even viscous 
25 filamentous fungi. High biomass levels are acceptable for the process as explained in 
example 9. An example is the extracellular endoglucanase I from the fungus Trichoderma 
reesei which can be tagged for instance with the class 2 HFBI and can for example be 
separated with the nonionic polyoxyethylene C12-C18E05. In this example the detergent 
rich phase is the lighter phase and contains most of the tagged endoglucanase while most 
30 of the other cellulases, proteases and other enzymes remain in the heavier phase. The 
mycelium separates to the bottom phase, too. The separation can be achieved using 



WO 00/58342 PCT/FI 00/00249 

11 

separation temperatures higher than 25 °C. The temperature can be decreased if certain salts 
like NaCl or K 2 S0 4 are added 

The invention describes separation of molecules produced in various different organisms 
5 such as bacteria, yeast and filamentous fungi. The invention is suitable for purification of 
product molecules from extra- or intracellular locations, including cell wall bound molecules. 
It provides examples how the fusion molecule can be secreted by these different organisms 
but also provide an example how the fusion can be produced intracellularly. 

10 The invention further describes how fusion molecules consisting of several domains can be 
constructed and successfully expressed and produced. The invention describes fusions of the 
targeting molecule to a small protein (CBD), to a moderately sized protein (EGI) and to a 
huge highly glycosylated protein (Flol), and different domain variations of these. These 
molecules can be ready as such for biotechnical use. Alternatively, the product can be 

15 cleaved from the targeting protein by any method known in the art such as with proteases 
e.g. thrombin, factor X, papain or by chemical cleavage. Furthermore, ATPS is a preferential 
means to be used to separate the product from the targeting protein after cleavage, or these 
can be separated with other methods known in the art. 

20 A suprising feature is that the targeting protein can also be used to carry large particles to 
the desired phase in ATPS. This can be obtained if the particles already contain proteins 
suited for targeting such as spores/conidia do in case of fungi. The targeting protein can also 
be attached to the particles or compounds in vitro. If cells are separated, the targeting 
protein can alternatively be expressed in the recombinant cells in such a way that it is 

25 exposed at the cell surface whereby it mediates the separation of the cells in ATPS. A 
teaching how this can be done is provided in example 22. Other types of molecules which 
direct the targeting molecule to the cell surface can be found e.g. in the literature including 
bacterial outer membrane proteins and lipoproteins (Stahl and Uhlen, 1997), and yeast 
proteins a-agglutinin and flocculin(Schreuder ai, 1996; KlisetaL (1994) WO 94/01567; 

30 Frenken (1994) WO 94/18330). 
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A further advantage of the system is that the invention combined with ATPS provides a 
means to separate the product or desired component not only from other unnecessary or 
unwanted proteins but also from harmful proteins such as proteases as described in example 
6. Thus, the invention is particularly suited for production and purification of heterologous 
5 proteins, e.g. sensitive mammalian proteins usually produced in limited amounts in 
heterologous hosts. Such proteins are for instance antibodies or fragments thereof, 
interferon, interleukin, oxidative enzymes and any foreign protein which can otherwise be 
produced in a host. It is possible that separation of the product from e.g. culture medium 
can also be obtained on-line or semi-continuously, thus minimising the effect of proteases 
10 or other harmful components present in the culture. When produced intracellular^, the 
invention also provides means to separate the heterologous product, for instance the 
inclusion bodies it may form, from the cellular extracts. 

This invention describes for the first time that fusion proteins containing hydrophobin-like 
15 molecules can be made and produced in significant amounts despite the very particular 
properties of hydrophobin-like molecules. Importantly, this invention also describes how 
recombinant strains producing increased amounts of hydrophobin-like proteins as such can 
be made. This provides means to produce the targeting protein for uses in which it is wanted 
that the targeting protein is bound to the product or particle in vitro, to enable further 
20 separation of such molecules or particles in ATPS. 

Importantly, this invention also decsribes how hydrophobin-like molecules can be purified 
in ATPS very efficiently with high K-values. The molecules can be separated in the same 
way as the above mentioned fusions, for instance by PEG systems of by detergent-based 
25 systems. Separation can be done from the culture medium or from cells. This provides a 
significant improvement in making pure preparates containing hydrophobin-like molecules 
since due to their properties their purification is very complicated and results in losses with 
the previously reported techniques as described above. 

30 Brief description of drawings 



Fig. 1 



shows the map of the plasmid pMQ103. 
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shows the map of the plasmid pMQl 13. 
shows the map of the plasmid pMQ 1 04. 
shows the map of the plasmid pMQl 14. 
shows the map of the plasmid pMQ105. 
shows the map of the plasmid pMQl 15. 
shows the map of the plasmid pMQ 121. 

Coomassie-stained 10% SDS-PAGE of the partitioning of EGIcore-HFBl 
fusion protein in two-phase separation using 5 % of the detergent CI 2- 
C18E05. Lane 1, Molecular weight marker; Lane 2, Purified CBHI (4 jig); 
Lane 3, Purified EGI (4 ^ig); Lane 4, 1/10 diluted VTT-D-98691 cellulose- 
based culture filtrate; Lanes 5 and 6, 1/10 diluted bottom phase and detergent 
phase (top phase), respectively, after separation of VTT-D-98691 culture 
filtrate with 5% detergent; Lane 7, Non-diluted bottom phase; Lane 8, Non- 
diluted VTT-D-98691 cellulose culture filtrate. 

Western analysis of the partitioning of EGIcore-HFBI fusion protein in two- 
phase separation by using different concentrations of the detergent CI 2- 
C18EOS. Fusion proteins were detected with anti-HFBI antibodies. Lane 1, 
Molecular weight marker; Lane 2, Purified EGI; Lane 3, VTT-D-98691 
cellulose culture filtrate; Lanes 4 and 5, Detergent phase (top phase) and 
bottom phase, respectively, after separation of VTT-D-98691 culture filtrate 
with 5% detergent; Lane 6, Same as lane 3, except 2 % detergent was used; 
Lane 7, Same as lane 4 ; except 2 % detergent was used; Lane 8, Purified EGI; 
Lane 9:, Purified CBHI. 
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Fig. 10 Coomassie-stained 10% SDS-PAGE showing further purification of EGIcore- 
HFBI fusion protein from the endogenous CBHI when the top phase was re- 
extracted with 2% detergent. Lane 1, Molecular weight marker; Lane 2 y 
Purified CBHI (4 ng); Lane 3, Purified EGI (4 ng); Lane 4, Detergent phase 
5 (top phase) after first extraction; Lane 5, Detergent phase (top phase) after 

second extraction. 

Fig. 11 Coomassie-stained 10% SDS-PAGE analysis of the EGI-HFBI protein when 
treated with thrombin. Lane 1, Molecular weight marker; Lane 2, EGI-HFBI 
10 (1 mg/ml) treated 72 h with 3 U of thrombin at 24°C; Lane 3, Same as lane 2, 

except no thrombin was added; Lane 4, EGI-HFBI (1 mg/ml) treated 48 h with 
9 U of thrombin at 36°C; Lane 5, Same as lane 4, except no thrombin was 
added; Lane 6, Same as lane 5, except no incubation at 36°C. 

15 Fig. 12 shows the map of the plasmid pTNS13. 

Fig. 13 Western analysis of the partitioning of dCBD-HFBI fusion protein in two-phase 
separation using 5 % of the detergent C12-C18E05. Fusion protein was 
detected with anti-HFBl antibody. Lane 1 , Four times concentrated culture 
20 filtrate; Lane 2, Four times concentrated bottom phase; Lane 3, Top phase. 

Fig. 14 shows the map of the plasmid pTNS15. A non-fimctional restriction site is 
indicated with an asterisk. 

25 Fig. 15 shows the map of the plasmid pTNS 1 8. 

Fig. 16 shows the map of the plasmid pTNS23. 

Fig. 17 shows that separation to the detergent phase of cells expressing a hydrophobin 
30 on cell surface occurs: the top detergent phase of S, cerevisiae VTT-C-993 1 5 

is turbid, whereas the top detergent phase of the control strain H2 1 55 is clear. 
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Fig. 18 shows the map of the plasmid pTNS32. 

Fig. 19 shows the map of the plasmid pTH4. 

Fig. 20 shows the map of the plasmid pTNS30. 

Fig. 21 shows the map of the plasmid pTHl . 

Fig. 22 shows the map of the plasmid pTH2. 

Fig. 23 shows the map of the plasmid pKS2. 

Fig. 24 SDS gel showing the purification of HFBI-dCBD in a Berol 532 ATPS system. 

Fig. 25 Western analysis of the partitioning of HFB I-EN A5 S cFv fusion protein in two- 
phase separation using 4% of the detergent C12-C18E05. Lanes from left: 
(1) Molecular weight marker, (2) Top (enriched) phase of the strain VTT-D- 
0079 1 3 (3) Bottom (depleted) phase of the strain VTT-D-00791. 



Fig. 26 Western analysis of the partitioning of SC3 hydrophobin using SC3 specific 
antibodies. Lane 1, Bottom phase of the strain VTT-D-00792, Lane 2, Top 
phase of the strain VTT-D-00792. 



Fig. 27 HPLC analysis for monitoring of the partitioning of HFBI in 2% Berol 532. 
Fig. 28 HPLC analysis for monitoring of the partitioning of HFBII in 2% Berol 532. 



The invention is further illustrated by the following Examples which describe construction 
of the fusion molecules of the invention, and partitioning of the molecules of interest usin^ 
the process according to the invention. 
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EXAMPLES 
Example 1 

Construction of vectors for expression of EGI and EGIcore HFBI fusion proteins 
under the cbhl and gpdl promoters of Trichoderma and gpdA promoter of Aspergillus 

For construction of an EGI-HFBI fusion protein, hfbl (SEQ ID 1) coding region (from Ser- 
23 to the STOP codon) and a peptide linker (Val Pro Arg Gly Ser Ser Ser Gly Thr Ala Pro 
Gly Gly) preceding it was amplified with PCR using pTNS9 as a template and as a 5 J primer 
TCG GG CACTACG TG C CAG TAT AGC AAC GAC TAC TAC TCG CAA TGC 
CTT GTT CCG CGT GGC TCT AGT TCT GGA ACC GC A (SEQ ID 2) and as a 3 5 primer 
TCG TAC GGATCC TCA AGC ACC GAC GGC GGT (SEQ ID 3). pTNS9 has been 
decribed in detail in Example 19. The sequence in bold in the 5' primer encodes 16 C-terminal 
residues of EGI. The sequence in italics is a thrombin cleavage site and the underlined 
CACTACGTG is a Drain site. The underlined GGATCC in the 3' primer is a Bamffl site. 
The 280 bp PCR fragment was purified from agarose gel and ligated to pGEM-T T/A vector 
(Promega) resulting in pMQ102. 

For construction of an EGIcore-HFBI fusion protein, the hfbl coding region (as above) was 
0 amplified with PCR using pTNS9 as a template and as a 5' primer ACT ACA CGG AG G 
AGC TC G ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG AGC AAC GGC 
AAC GGC (SEQ ID 4) and as a 3* primer SEQ ID 3. The sequence in bold in the 5' primer 
encodes amino acids 410-425 in EGI and the underlined GAGCTC is a SacI site. The 260 bp 
PCR fragment was purified from agarose gel and ligated to pPCRII T/A vector (Invitrogen) 
5 resulting in pMQ 111. 

In the next step Trichoderma expression vectors for production of EGI-HFBI and EGIcore- 
HFBI fusion proteins under the control of cbhl promoter and terminator sequences were 
constructed. The expression vector used as a backbone in the constructs is pPLE3 (Nakari et 
0 al. (1994) WO 94/04673) which contains a pUC18 backbone, and carries the cbhl promoter 
(SEQ ID 5) inserted at the EcoRI site. The cbhl promoter is operably linked to the full length 
egll cDNA (SEQ ID 6) coding sequence and to the cbhl transcriptional terminator (SEQ ID 
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7). The plasmid pMQ102 was digested with Dralll and BaraHI and the released 280 bp 
fragment containing hfbl and linker sequences was purified from agarose gel and ligated to 
pPLE3 digested with Dralll and BamHL The plasmid pMQIll was digested with SacI and 
BamHI and the 260 bp fragment containing the hfbl sequence was ligated to pPLE3 digested 
5 with SacI and BamHI. The resulting plasmids pMQ103 (Figure 1) and pMQl 13 (Figure 2) 
carry the coding sequences for full-length EG1 linked to HFBI via a peptide linker and for 
EGIcore linked to HFBI via its own linker region, respectively, under the control of cbhl 
promoter and terminator sequences. 

1 0 Trichoderma expression vectors for production of EGI-HFBI and EGIcore-HFBI fusion 
proteins under the control of gpdl promoter and terminator sequences of Trichoderma and 
gpdA promoter and trpC terminator sequences of Aspergillus were constructed as follows. A 
SacII site was inserted inbetween the Xbal and Pad sites of pMV4 using as an adapter 
annealed primers TAA CCG CGG T (SEQ ID 8) and CTA GAC CGC GGT TAA T (SEQ 

15 ID 9). The resulting plasmid is pMVQ. pMV4 contains a pNEB193 (New England Biolabs) 
backbone, and carries a 1.2 kb Trichoderma gpdl promoter (SEQ ID 10) and a 1.1 kb gpdl 
terminator (SEQ ID 11) inserted at Sall-Xbal and BamHI-AscI sites, respectively. The 
expression cassettes for EGI-HFBI and EGIcore-HFBI were released from pMQ103 and 
pMQ113 with SacE and BamHI, purified from agarose gel and ligated to pMVQ cut with 

2 0 SacII and BamHI. The resulting plasmids pMQ104 (Figure 3) and pMQl 14 (Figure 4) carry 
the EGI-HFBI and EGIcore-HFBI cassettes, respectively, under the control of Trichoderma 
gpdl transcriptional control sequences. Expression plasmids pMQ105 (Figure 5) and 
pMQ115 (Figure 6) containing EGI-HFBI and EGIcore-HFBI cassettes, respectively, 
operably linked to the gpdA promoter and trpC terminator of Aspergillus were constructed. 

2 5 EGI-HFBI and EGIcore-HFBI cassettes were released from plasmids pMQ 1 04 and pMQ 1 1 4 
with Xbal and BamHI, blunted with T4 DNA polymerase and ligated to Ncol digesteded and 
T4 DNA polymerase treated pAN52-l (SEQ ID 12). pAN52-l contains a pUC18 backbone, 
and carries a 2.3 kb gpdA promoter and a 0.7 kb trpC terminator sequences of A. nidulans. 
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Example 2 

Construction of vectors for over-production of HFBI on cellulase-inducing and - 
repressing media 

5 For over-expression of HFBI under cbhl promoter the protein coding region of hfbl was 
amplified with PCR using as a template pEAlO (Nakari-Setala etal, 1996). pEAlO carries a 
5.8 kb genomic Sail fragment containing hfbl coding and flanking sequences. GTC AA C 
CGCGG A CTG CGC ATC ATG AAG TTC TTC GCC ATC (SEQ ID 13) was used as 
a 5' primer in the PCR and as a 3' primer SEQ ID 3. The sequence in bold in the 5' primer is 

10 21 bp of cbhl promoter adjacent to translational start site of the corresponding gene and the 
underlined CCGCGG is a Kspl site. The obtained fragment of 430 bp was digested with Kspl 
and Bamffl and ligated to pMQ103 digested with Kspl and BamHI. The resulting plasmid 
pMQ121 (Figure 7) carries the coding sequence of hfbl operably linked to cbhl 
transcriptional control sequences. pEAlO plasmid is used for over-production of HFBI in 

1 5 cellulase-repressing conditions. 

Example 3 

Transformation of Trichoderma and purification of the EGI-HFBI and EGIcore-HFBI 
producing and HFBI over-producing clones 



20 



Trichoderma reesei strains QM9414 (VTT-D-74075) and Rut-C30 (VTT-D-86271) ^vere 
co-transformed essentially as described (Penttila et aL, 1987) using 3-13 ug of the plasmids 
pMQ103, pMQ113, pMQ104, pMQ114, pMQ105, pMQ115, pMQ121 and pEAlO and as 
the selection plasmids 1-3 ug P ToC202, P 3SR2 or pAR021. P ToC202 (pUC19 backbone) 

2 5 and p3SR2 (pBR322 backbone) plasmids carry 2.7 kb Xbal and 5 kb EcoRI-Sall genomic 

fragments of A nidulans, respectively, containing the amdS gene (Hynes et aL, 1983; Tilburn 
et aL, 1983). pAR021 is essentially the same as P RLMex30 (Mach et al. 1994) and carries 
the E. coli hph gene operably linked to 730 bp ofpkil promoter and 1 kb of cbhl terminator 
sequences of T. reesei. The Amd+ and Hyg+ transformants obtained were streaked three 

3 0 times onto plates containing acetamide and hygromycin, respectively (Penttila et aL, 1987). 

Thereafter spore suspensions were made from transformants grown on Potato Dextrose agar 
(Difco). 
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The production of the fusion proteins EGI-HFBI and EGIcore-HFBI and HFBI was tested by- 
slot blotting or Western analysis with EGI and HFBI specific antibodies from shake flask or 
microtiter plate cultivations carried out in minimal medium supplemented with either glucose, 
lactose or a mixture of Solka flock cellulose and/or spent grain and/or whey. The spore 
suspensions of the fusion protein producing clones were purified to single spore cultures on 
selection plates (containing either acetamide or hygromycin). To determine the best 
producers, production of the fusion proteins was analyzed again from these purified clones as 
described above. 

T. reesei strains selected for further fermentor cultivations are VTT-D-98692 (pEAlO), VTT- 
D-98492 (pMQ121), VTT-D-98693 (pMQ103), VTT-D-98691 (pMQ113), VTT-D-98681 
(pMQ105) and VTT-D-98682 (pMQ115). These strains have QM9414 as the host strain. 
VTT-D-99702 (pMQl 13) has Rut-C30 as the host strain. 

Example 4 

Cultivation of the EGI-HFBI and EGIcore-HFBI protein producing and HFBI over- 
producing Trichoderma strains 

EGI-HFBI and EGIcore-HFBI fusions were produced under the cbhl promoter in a 1 5-litre 
fermenter using 7\ reesei strains VTT-D-98693 (pMQ103) and VTT-D-98691 (pMQ113), 
respectively. Strains were grown 5 days on minimal medium (Penttila et al. y 1987) containing 
4% Solka flock cellulose (James River Corporation, Berlin, NH) and 2 % spent grain 
(Primalco, Koskenkorva, Finland). EGIcore-HFBI was also produced in fermenter (15 1) 
using the Rut-C30 strain VTT-D-99702 (pMQl 13) with 4 % lactose medium. To induce the 
production of EGI-HFBI and EGIcore-HFBI fusions under Aspergillus gpdA promoter, T. 
reesei strains VTT-D-98681 (pMQ105) and VTT-D-98682 (pMQl 15) were cultivated in 1 5- 
litre fermenter. Strains were grown 3 to 5 days on minimal medium supplemented with 2% 
glucose, 0.2% Peptone, and 0.1% Yeast Extract, and with glucose feed to maintain the 
glucose concentration in the range of 1 to 3% throughout the cultivation. HFBI over- 
producing strain VTT-D-98692 (pEAlO) was grown similarly in 1 5 1 on glucose medium and 
the strain VTT-D-98492 (pQM121) over-producing HFBI under cbhl promoter was 
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M for 7 da ys m ,5*. r— o„ med,um «* s*. ** ^ 

sp e„ grain. The conuo, culbvadons with the host — ^ ^* 

'1,4075) an, Kut-CBO CVTT.D-Bc.a7l), were earned out on med* — . 0 
K ™w ;» lactose and m) glucose similarly as 

Solka flock cellulose and either spent gram or whey, u) lactose ana ) g 

5 described above. 

When proper some 71 reese, transform. — - «* "» «"» — ****** " 
28 -C I abated for 5 to 6 da y s m 50 ,o ,50 mJ volume of 77,cWe™> — 1 — » 

U» - - • — ^ e " her 3% S0,k " fl ° Ck ^ ^ ' % ^ " 
1 o 3-4% glucose with glucose feeding. 

Example 5 

Standard separation assays and analysis 

! S ff no, otherwise ~d the standard ATPS and subsequent anises and —as were 
carried out as explained in this example. 

ta general whole fennentauon broth, supernatant (biomass separated * ****** - 
Ladon, or pudned proteins ,0 buffer were separated In ,0 ml graduated tubea. P rs 
2 0 ZJ. waa added ,n,o the babes and the tubes were then dUed to ,0 mg wrth pre e,„ 
onJLg «ou,d. The amount of detergent in the babe was calcuiated - we,gb, percent of 
" . Lr thorough rmtung ,n an overhead shaker the separation too, piece b y edher 
grav, y s«dmg in . water bath a. constant temperature or b y cendddagadon a, cons an 
Lperadare. The separahon usuah, was perfomteu a. 30-C, the — anrouM 

. n „, (Wv) After separation the volume ratio was noted and samples 
2 5 detergent used was 2-5 /o (w/v). ^ H 

were iaken from the lighter and heavrer phase for analysis. 

Two-phase separadons were ana, y sed q uah,a,,ve, y b y using SDS-PAGE gels followed b y 
vJLuon of dae lirsion protems with Coom»as,e bnlhan, blue R-750 (S, g ma, or W~» 
30 blotdng PolyConal anb-HFB, andbod y were use, in Western an*,s for de ecn n of 
EcUhfL, EGI-HFBI and dCBD-HFBI protems together wrtb alkaline phospha.as 
conjugated and-rabb,. «0 (Bio-Rad) Alkahne phosphatase acdvi, y was detected 
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colorimetricaUy with BCEP (5-bromo-4-chloro-3-indolyl-phosphate) used in conjunction with 
NBT (nitro blue tetrazolium) (Promega). 

Contaminating endogenous EGI, CBHI and EGEQ in the top phase was tested with 
5 appropriate antibodies. Acidic protease activity in the top and bottom phase was also tested 
using the SAP method (Food Chemicals Codex, p. 496-497, 1981), which is based on the 30 
min enzymatic hydrolysis of a hemoglobin substrate. All reactions were performed at pH 4.7 
and 40 °C Unhydrolyzed substrate was precipitated with 14% TCA and removed by 
filtration. The released tyrosine and tryptophan was determined spectrophotometrically. Total 
10 protein concentrations were determined by Non-Interfering Protein Assay™ (Geno 
Technology, Inc). 

EGI activity was detected using 4-methylumbelliferyl-fl-D-cellobioside (MUC) (Sigma M 
6018) as substrate (Van Tilbeurgh H. & Caeyssens M., 1985; Van Tilbeurgh et.al, 1982). 

15 EGI hydrolyses the fl-glycosidic bond and fluorogenic 4-methylumbelIiferone is released, 
which can be measured using a fluorometer equipped with a 360 excitation filter and a 455 
nm emission filter. CBHI also hydrolyses the substrate and it was inhibited by addition of 
cellobiose (C-7252, Sigma). EGI containing liquid was added in an appropriate dilution to a 
buffer containing 50 mM sodium acetate buffer (pH 5), 0.6 mM MUC and 4 6 mM 

2 0 cellobiose. The mixture was heated to 50°C. The reaction was stopped after ten minutes 
using 2% Na 2 C0 3 , pH 10. Purified CBHI was detected using the same assay as for EGI 
without the addition of the inhibitor cellobiose. 

The partition coefficient K was defined as the ratio of the measured concentrations or 
2 5 activities in the top and bottom phase, respectively. 
The Yield Y was defined as follows: 
1 



Y T =- 



1 + 



H 1 

V T K 



30 



where Y T is the Yield of the top phase, V B and V T are the volumes of top and bottom phase, 
respectively. The Yield of the bottom phase can be described accordingly. 
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The mass balances, e.g. recovery of all added protein, were always checked for completeness 
to ensure no artificially high Y.eld (e.g. due to possible inactivation of the protein in the 
bottom phase). The values were usually calculated based on total enzyme activity (EGI wt 
plus the EGI-fusion) and thus the values are underestimated for the separation of the fusion as 
5 demonstrated in Example 16. 

Example 6 

Small scale ATPS separation studies and gel analysis 

10 EGI-HFBI and EGlcore-HFBl fusions produced under the cbhJ promoter in a 15-litre 
fermenter on Solka flock cellulose and spent grain medium as described in Example 4 using T. 
reesei strains VTT-D-98693 (pMQ103) and VTT-D-98691 (pMQ113), respectively, were 
separated in small scale ATPS as described above. 

1 5 The phases from the two phase separations were analysed qualitatively by using SDS-PAGE 
gels followed by visualization of the fusion proteins with Coomassie brilliant blue or Western 
blotting. Coomassie stained SDS-PAGE (10%) is shown in Figure 8. In the lane containing 
the non-extracted culture filtrate three distinct closely migrating bands can be seen (the 
sample was diluted 1/10 with H 2 0). The topmost band is CBHI, the band in the middle is 

20 EGIcore-HFBI fusion and the lower one endogenous EGI. In the samples separated in 
ATPS, only two bands (CBHI and EGI) are seen in the sample from bottom phase and one 
band representing EGIcore-HFBI in the sample obtained from the top phase. 

Western blotting with HFBI antibody showed thick bands for the top phase, whereas for the 

2 5 bottom phase there was only faint band demonstrating that the fusion is separating strongly 

into the detergent top phase. Figure 9 shows the separation of the EGIcore-HFBI fusion 
produced on cellulose media into the top phase. Contaminating endogenous EGI and EGIIl 
in the top phase was tested with appropriate antibodies but no signal was detected. 

3 0 Small amount of endogenous CBHI was found in the upper phase when CBHI antibody was 

used in Western blotting. EGI, EGIIl and proteases were not found in the top phase. Further 
purification from the contaminating CBHI was observed when the top phase was re-extracted 
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with 2 % detergent. The Figure 1 0 shows that the upper phase does not any more contain 
CBHI and pure fusion protein is recovered. 

EGIcore-HFBI was also produced in fermenter (15 1) using the Rut-C30 strain VTT-D- 
5 99702 (pMQ113) with 4 % lactose medium. The separation in ATPS carried out in the 
standard manner gave essentially the same result as the separation from cellulose containing 
medium thus demonstrating that the purification can be carried out from several media 
relevant for large scale industrial use. 



0 Acidic protease activity in the top was only 1/1 5 compared to the bottom phase (table below) 
demonstrating that acidic proteases remain in the bottom phase. 





A(275nm) 


HUrVml 


Bottom phase 1 


0.146 


41.6 


Top phase 2 


0.009 


2.6 



l l/10 diluted bottom phase after separation of VTT-D-98691 culture filtrate with 2% detergent 
5 2 1/100 diluted bottom phase after separation of VTT-D-98691 culture filtrate with 2% detergent 

3 1 HUT = enzyme concentration, which in reaction conditions hydrolyses hemoglobin in 1 min so that 
the absorbance at 275 nm of the formed hydrolysate equals 1 . 10 ug tyrosine/ml 0.006 N HC1 solution. 

These results show that the fusion protein can be purified extremely efficiently and the 
0 resulting preparate is free of other proteins produced by the fungus including proteases. 

Example 7 

Recovery of the native EGI in ATPS after thrombin cleavage 

5 EGI-HFBI protein produced by the strain VTT-D-98693 has a thrombin cleavage site 
(LVPRGS) designed in the linker region between the EGI CBD and HFB1, which would 
enable the recovery of the native EGI after thrombin cleavage. EGI-HFBI fusion protein was 
purified from the culture filtrate (100 ml) of strain VTT-D-98693 grown on 4% Solka flock 
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cellulose and 2 % spent grain as described in Example 4 using the 2-phase separation system 
(5 % detergent). After removal of the bottom phase the detergent phase was extracted by 
isobutanol. The resulting water phase (-19 ml) was divided in eppendorf tubes and the liquid 
was evaporated with speed vac. Remaining lyophilizate was diluted to 50 mM Tris-Cl (pH 8). 
5 To test the efficiency of thrombin cleavage, 9 units of thrombin (Sigma) was incubated > 24 h 
with 1 mg EGI-HFBI fusion protein in 36 °C at pH 8.0. Coomassie stained SDS-PAGE (10 
%) was used for detection. 

Only minor cleavage was observed in 48 h under these conditions (Figure 1 1), possibly due to 
1 0 steric hindrance by O-glycosylation in the linker. 



Example 8 

Separation of low concentrations of EGIcore-HFBI in ATPS 

15 Detergent based aqueous two-phase systems were successfully applied using very low 
concentrations (diluted) of EGIcore-HFBI fusion protein produced with the cbhl promoter in 
T. reesei VTT-D-98691 (pMQ 113) from a 15 liter cultivation carried out on Solka flock 
cellulose with spent grain as described in Example 4. 

2 0 The original protein concentration of the supernatant was 7.0 mg/ml. This supernatant was 
diluted with de-,onised water by a factor of 100 and 1000, respectively. The fusion protein 
could be separated using 2% (w/w) of the detergent C12-C18E05 with partitioning 
coefficients higher than 5. This is shown in the table below together with the experiment with 
non-diluted supernatant. The partitioning coefficients were calculated based on activity 

2 5 measurements for total EGI (wild type and fusion protein together). 





EGIcore-HFBI 
non-diluted supernatant 


dilution 1/100 


dilution 1/1000 


K 


4.1 


5.3 


5.6 


Y [%] 


38 


31 


32 
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Example 9 

Separation of EGIcore-HFBI from fungal biomass containing culture broths 



EGIcore-HFBI from T.reesei strain VTT-D-98691 (pMQ 1 13-2) was cultivated (50 ml in 
5 250 ml shake flasks) on Solka flock cellulose with spent grain as described in Example 4. 
Directly after the cultivation, part of the whole broth was centrifuged at 3000 rpm for 30 min, 
supernatant was spilled out and the centrifuged mycelium was added to the supernatant to 
obtain artificial whole broths containing different amounts of biomass. 

10 Using 5% of C12-C18E05 in a 10 g experiment consisting of up to 50% wet biomass 
(weight of wet biomass divided by the sum of wet biomass and supernatant) could still be 
separated without any difficulties. The Yield remained in between 61 and 64 % and therefore 
it is not significantly different in comparison to the experiment carried out with supernatant 
only (without mycelium) (see table below). The total recovery of the fusion protein is even 

15 higher. This is most probably due to cell attached enzyme extracted in the ATPS increasing 
the total amount of EGI. The partitioning coefficient was calculated based on activity 
measurements for total EGI (wild type and fusion protein together). 





K 


Y[%] 


Supernatant 


5.5 


62 


25% of wet biomass in 
supernatant 


7.3 


66 


40% of wet biomass in 
supernatant 


6.4 


61 


50% of wet biomass in 
supernatant 


7.6 


64 
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Example 10 

Separation of EGI-HFBI in ATPS 

EGI-HFBI from Trichoderma reesei strain VTT-D-98693 (pMQ 103) from a 15 liter 
cultivation carried out on Solka flock cellulose and spent grain as described in Example 4 was 
separated in a 10 g experiment using different amounts of C12-C18E05. The partitioning 
coefficients are shown below. The partitioning coefficient was calculated based on activity 
measurements for total EG1 (wild type and fusion protein together), and as in previous 
examples the endogenous EGI is included in the partitioning coefficients. 



Detergent [% w/w] 


2 


3 


5 


7 


K 


1.9 


1.8 


1.4 


1.1 



>5 



Example 11 

Separation of EGIcore-HFBI in 50 ml 

EGlcore-HFBI from T. reesei strain VTT-D-98691 (pMQ 113) cultivated in 15 liters using 
Solka flock cellulose and spent grain as described in Example 4 was separated in Falcon tubes 
in a 50 g experiment using 5% of C12-C18E05. A partition coefficient of 2.52 and a yield of 
51 % could be obtained. The separation was performed at 30°C at 3000rpm for 30 minutes 
The values are based on activity measurements for total EGI activity (wild type and fusion 
protein together) including endogenous EGI. 

Example 12 

Separation of EGIcore-HFBI in ATPS using different detergents 

EGlcore-HFBI from T. reesei strain VTT-D-98691 (pMQ 1 13) cultivated in 15 liters using 
Solka flock cellulose with spent grain as described in Example 4 was separated in a 10 g 
experiment using 2% of detergent in each experiment. The detergents investigated in this 
example were C10 EOS, C12 EOS, C14 E06 (each Nikko Chemicals, Japan), C12-C1 8 E05 
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CAgnmul NRE 1205", Henkel, Germany), CI 2/1 4 5EO, CI 2/1 4 6EO ( Clariant, Germany), 
C9/11 EOS. 5 („Berrol 266", Akzo Nobel, Germany), Triton X-114 (Sigma, Germany). The 
partition coefficients and yields are listed below. The values are based on activity 
measurements for total EGI activity (wild type and fusion protein together) including 
5 endogenous EGI. 





K 


Y(fosion) [%] 


C10EO5 


20 


56 


C12E05 


15 


57 


C12-C18E05 


14 


66 


C12/14 5EO 


12 


58 


C12/14 6EO 


14 


62 


C14E06 


11 


54 


C9/11 E05.5 


5 


30 


Triton X- 114 


0.16 


53 



0 Example 13 

Separation of EGIcore-HFBI in ATPS from glucose grown cultures 

EGIcore-HFBI was separated from cultivation of the Trichodemia reesei strain VTT-D- 
98682 (pMQ115) cultivated with glucose as described in Example 4. The supernatant was 
5 separated with 2% of the detergent C12-C18 EOS. The fusion protein could be partitioned 
with a K value of 2.4. In comparison, the K value for the native EGI is 0.3 when measured in 
a similar way for purified EGI. 

Example 14 

0 Separation of EGIcore-HFBI using different concentrations of detergent 

EGIcore-HFBI from T. reesei VTT-D-98691 (pMQ 113) cultivated in 15 litres using Solka 
flock with spent grain as described in Example 4 was separated in detergent based ATPS 
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applying different amounts of the detergent C12-C18 EOS on the cell free supernatant. The 
partitioning coefficients are shown in the table below. The corresponding gel electrophoresis 
and Western antibody-blots are shown in Figure 8 and Figure 9, respectively. 
The values are based on activity measurements of total EGI activity. 



Amount of detergent C12-C18 E05 


K 


Yield(%) 


1.0% 


6.1 


9 


2.0% 


4.1 


38 


3.5% 


3.6 


50 


5.0% 


2.9 


55 


7.5% 


1.7 


53 


10.0% 


1.1 


58 



10 



Example 15 

Re-extraction of the detergent phase 



Detergent based ATPS was applied on EGlcore-HFBI fusion protein containing supernatant 
produced by the strain VTT-D-98691 (pMQ 113) in a shake flask cultivation. The first 
extraction using C12-C18E05 conducted under the standard conditions shows a partitioning 
coefficient of 16 and a yield of 72 % ( wild type EGI measured together with fusion protein) 

15 The top phase was re-extracted in 10 mM sodium acetate buffer (pH 5) with 2% of 
detergent. A partitioning coefficient of 52 and a yield of 89 % could be obtained. In the re- 
extraction experiment of the bottom phase (2% of detergent), a small yield of 7.5% and a K 
of 0.8 of EGI activity were achieved The partitioning coefficients were calculated based on 
activity measurements for total EGI (wild type and fusion protein together). Due to the wild 

2 0 type EGI present in the sample, the yield is at least 72% and the partitioning coefficient at 
least 16 in the first extraction. The SDS-PAGE results of both extraxtions are shown in 
Figure 10. 
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Separation step 


K 


Y[%] 


2% detergent 


16 


72 


reextraction top phase 


52 


89 


reextraction bottom phase 


0.8 


7.5 



Example 16 

Separation of pure cellulases in ATPS 

5 

The effect of HFBI on partitioning and the final yield can further be demonstrated by 
comparing the extraction result of EGIcore-HFB fusion with extraction results obtained with 
purified wild type EGI and EGIcore. The fusion protein is partitioning more than 100 times 
better to the detergent phase (see table belove). 

0 

The improvement on the partitioning of the purified fusion protein from the first extraction 
obtained in the re-extraction (see Example 15) can be explaned by the partitioning of the wild 
type EGI as demonstrated with purified wild type EGI in the table below. The wild type EGI 
lowers the partitioning coefficient in the first extraction (since EGI activity is measured from 
5 both top and botom phase), but the absence of it in the re-extraction increases the partitioning 
coefficient of the EGIcore-HFBI fusion. The purity can in addition be demonstrated by 
analysing the partitioning of pure CBHI, which is the major contaminating protein 
corresponding to about 50 % of all secreted T. reesei proteins. Pure CBHI has a partitioning 
coefficient of 0.5 and a yield of 3 .6 and is therefore separated from the fusion protein. 

0 



Separation step 


K 


Y[%] 


re-extraction of top phase 


52 


89 


extraction of pure wild type EGI 


0.3 


2.2 


extraction of pure EGI-core 


0.3 


2.3 


extraction of pure CBHI 


0.5 


3.6 



WO 00/58342 



PCT/FI 00/00249 



30 



Using the definitions of K and Y and calculating mass balances, the ratio of the amount of 
EGI fusion protein to EGI wild type can be calculated. The "true" partition coefficients and 
Yields can be concluded from this. "True" means the values which would be detected if it 
would be possible to measure only the amount of EGI-fusion without measuring the amount 
5 for EGI wild type at the same time. 

The fundament for the calculation is the re-extraction experiment. The re-extracted top phase 
is believed to be pure. An example of the measured values and the calculated "true" values 
based on this are shown in the table below for two cultivations of VTT-D-98691 (pMQl 13) 
1 0 grown as described in Example 4. 



cultivation 
vessel 


cultivation 
substrate 


K "with EGI 
wt" 


"true" 
K 


Y [%] "with 
EGI 

wt" 


"true" 
Y[%] 


15 liter 
fermenter 


whey permeate 


4 


6 


16 


54 


250 ml shake 
flask 


cellulose 


16 


54 


66 


90 



1 5 Example 17 

HFBI and HFBII purification in ATPS 

HFBI was produced by cultivating the T. reesei strain VTT-D-98692 (pEA10-103B) using 
glucose as substrate as described in Example 4. HFBI could be separated using 2% of the 
2 0 detergent C12-C18 E05 with a partition coefficient higher than 20 under the standard 
conditions described. 



HFBII was produced by cultivating the T. reesei strain VTT-D-74075 (QM9414) on whey 
spent grain as described in Example 4. HFBII could be separated using 2% of the detergent 
2 5 C12-C 1 8 EQ5, exceeding a partition coefficient of 1 0 under the standard conditions. 



Both HFBI and HFBII hydrophobins are thus partitioning well to the upper phase i 
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Example 18 

Detergent based ATPS with additional NaCl 

EGIcore-HFBI from which cultivation of T.reesei was separated in a 10 g experiment using 
5 5% of C12-C18E05. The partitioning coefficient of the supernatant was 3.5 with a volume 
ratio of 0.2. Using 1.1 % (w/v) NaCl the partitioning coefficient could be increased to 4.3 
with a lower volume ratio of 0. 14. 

Example 19 

1 0 Construction of an E. coli strain expressing a fusion protein HFBI-dCBD, containing 
hydrophobin I and double cellulose binding (CBD) domains 

A 280 bp DNA fragment containing a modified cbh2 linker region followed by the coding 
region of hfbl from Ser-23 to the STOP codon was amplified by PCR using the plasmid 

15 pAROl (Nakari-Setala et ai, 1996) as a template. The 5' primer was 5' TCT AGC AAG 
CTT GGC TCT AGT TCT GGA ACC GCA CCA GGC GGC AGC AAC GGC AAC 
GGC AAT GTT TGC (SEQ ID 14) and the 3' primer was 5' TCG TAC AAG CTT TCA 
AGC ACC GAC GGC GGT (SEQ ID 15). The sequences in bold in the 5' and 3' primers 
encode the modified CBHII linker (Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly) and a 

2 0 translationai STOP, respectively, and the underlined AAGCTT in both primers is a Hindlll 
site. The PCR fragment was purified from agarose gel, digested with Hindlll and ligated to 
HindlEI digested and SAP treated (Shrimp Alkaline Phosphatase, USB) pSP73 resulting in 
plasmid pTNS9. 

2 5 For subsequent cloning of the modified CBHII linker-HFBI fragment to an E. coli expression 

vector, pTNS9 was digested with Hindlll and the proper fragment was purified from agarose 
gel. This Hindlll fragment was cloned to Hindlll digested and SAP treated (Shrimp Alkaline 
Phosphatase, USB) B599 resulting in pTNS13 (Figure 12). The E. coli expression vector 
B599 is essentially the same as the one described by Under et al. (1996) except that it is 

3 0 missing a STOP codon at the end of the protein coding sequence. It carries the coding 

sequence for a fusion protein containing CBHII CBD (41 N-terminal residues of CBHII) and 
CBHI CBD linked together via CBHI linker region (CBHI linker and CBD are the last 57 
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residues in CBHI). The expression and secretion of the fusion protein in B599 is under the 
cotrol of toe promoter and pelB signal sequence (Takkinen et aL, (1991). pTNS13 expression 
vector thus carries the coding region for a fusion protein of double CBD and HFBI linked in 
frame via the Gly-Ser-Ser-Ser-Gly-Thr-Ala-Pro-Giy-Gly peptide. This vector also contains 
5 the amp gene for selection of E. coli transformants. pTNS13 plasmid was transformed into E. 
coli strain RV308 (su-, A/acX74, ga/ISII::OP308, strA) and this strain was used for 
production of the fusion protein. 

Example 20 

1 0 Separation of HFBI-dCDB molecules expressed in R coli in ATPS 

dCBD-HFBI was produced in E. coli strain RV 308 transformed with pTNS13 plasmid as 
described above. The inoculum of RV308/pTNS13 was grown to the exponential growth 
phase in LB medium containing ampicillin (0. 1 g/1) and 1% glucose. Fermentation was carried 

1 5 out using mineral salt medium described by Pack et aL (1 993) with glucose (feed) in 1 0 litre 
fermenter. During cultivation temperature was maintained at 28 °C and pH was controlled at 
6.8 with NH4OH. Cell growth was monitored by measuring OD 60 o and dry weight of 
biomass. The culture was induced with 50 (final concentration) IPTG (isopropyl-P-D- 
thiogalactopyranoside) at late-exponential growth phase (OD6oo=50-60) to promote fusion 

2 0 protein production. 

Two-phase separation analysis of dCBD-HFBI protein was performed using culture filtrate 
and 5% detergent in the total volume of 40 ml. Results from Western blotting showed that 2- 
phase separation with 5 % detergent in the standard way was highly specific also for the 
2 5 dCBD-HFBI fusion. Strong signal was observed in the sample from the detergent phase 
compared to the sample from the bottom phase as shown in Figure 13. 
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Example 21 

Construction of yeast strains expressing HFBI-Flol fusion protein on the cell surface 

For construction of a HFBI-FLOl fusion protein expression cassette, hjbl (SEQ ID 1) 
5 coding region (from Ser-23 to the STOP codon) was amplified with PCR using pAROl 
(Nakari-Setala et al. 7 1996) as a template and as a 5' primer TCT AGC TCT AGA AGC 
AAC GGC AAC GGC AAT GTT (SEQ ID 16) and as a 3' primer TGC TAG TCG ACC 
TGC TAG C AG CAC CGA CGG CGG TCT G (SEQ ID 17). The underlined sequences in 
the 5' and 3' primers are Xbal and Nhel sites, respectively. The 0.225 bp PCR fragment was 
0 purified from agarose gel and ligated to pGEM-T vector (Promega) resulting in pTNSlO. 
The hjbl fragment was released from pTNSlO with Xbal and Nhel and ligated to pTNS15 
cut with the same restriction enzymes. Plasmid pTNSIS (Figure 14) is essentially the same as 
plasmid pBR- ADH 1 -FLO 1 L by Watari et al. 1994 except that a Nhel site in the pBR322 
backbone has been replaced by a BgUI site and a unique Xbal site is introduced by linker 
5 cloning in the unique AocI site preceding the putative signal sequence cleavage site. The 
resulting plasmid pTNS18 (Figure 15) contains the complete expression cassette for HFBI- 
FLOl fusion protein in which HFB1 substitutes the putative lectin domain from Ser-26 to 
Ser-3 19 in the yeast flocculin FLOl (SEQ ID 1 8). 

0 In the next step, yeast expression vector for production of HFBI-FLOl fusion protein was 
constructed. The expression vector used as a backbone in the construct is pYES2 
(Invitrogen) (SEQ ID 19) which is a high-copy episomal vector designed for inducible 
expression of recombinant proteins in S. cerevisiae. It carries GAL1 promoter and CYC1 
terminator sequences which regulate transcription, and 2\x origin of replication and URA3 

5 gene for maintenance and selection in the host strain. The plasmid pTNS 1 8 was digested with 
Hindlll and the released 3.95 kb fragment containing the expression cassette for HFBI-FLOl 
was purified from agarose gel and ligated to pYES2 digested with Hindm This ligation 
mixture was concentrated by standard ethanol precipitation. The ligation mixture should 
contain besides unligated fragments and uncorrect ligation products also molecules where the 

0 vector and insert are correctly ligated with each other to result in plasmid pTNS23 (Fig. 16) 
which carries the expression cassette for HFBI-FLOl operably linked to GAL1 and CYC J 
terminator sequences. 
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The above ligation mixture was transformed using the LiAc method of Gietz el al. (1992) 
into a laboratory S. cerevisiae strain H452 (wild type W303-1A; Thomas and Rothstem, 
1989). Transformant colonies able to grow on SC-URA plates were picked and streaked on 
5 selective plates. Nitrocellulose replicas were taken from the plates and treated for colony 
hybridization according to Sherman et al. (1983). To find those yeast colonies containing the 
pTNS23 plasmid, replicas were hybridized with digoxigenin labelled hfbl coding fragment 
after which an immunological detection was performed all according to the manufacturer 
(Boehringer Mannheim). Plasmids were recovered from several yeast colonies giving positivie 
10 hybridization signal by isolating total DNA and using this in electroporation of E. coli. 
Restriction mapping and sequencing were carried out to confirm that the P TNS23 plasmid in 
the yeast transformants was correct. One of the transformants carrying plasmid P TNS23 was 
chosen for further studies and was designated VTT-C-993 1 5 . The control strain for it is yeast 
strain H21 55 which carries the plasmid pYES2 in H452 background. 

15 

Example 22 

Separation of yeast cells expressing HFBI-Flol fusion protein in ATPS 

The Saccharomyces cerevisiae strain VTT-C-993 15 (vector pTNS23) and its control strain 
2 0 H2155 (vector pYES2) were cultivated on synthetic complete medium lacking uracil (SC- 
URA) (Sherman, 1991) with 2% galactose as the carbon source to give an A™ of 
approximately 4. Approximately 6.3 x 10 8 cells in their culture medium were taken to ATPS 
using 7% (w/v) C12-18E05 detergent (Agrimul NRE from Henkel) in a total volume of 5 ml. 
ATPS was carried out using strandard protocol. After phase separation by gravity settling, 
2 5 the top detergent phase was clearly turbid in the case of the strain VTT-C-99315 in contrast 
to the control strain whose detergent phase was clear (Figure 17). Samples were taken from 
the top phases and dilution series from 10" 1 to 10" 5 were prepared in 0.9% NaCl and plated on 
YPD plates. After incubation at 30°C the amount of yeast colonies were calculated showing 
at least 70 times more yeast colomes of the strain VTT-C-993 1 8 on YPD plates compared to 
30 the control strain. This clearly manifests that also in a system over-loaded with cells, 
separation to the detergent phase of cells expressing a hydrophobin on cell surface occurs. 
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Example 23 

Partitioning of EGIcore-HFBI fusion protein in hydrophobin-containing and non- 
containing pure systems 

5 The effect of free HFBI and HFBII hydrophobic on the partitioning of EGIcore-HFBI 
was investigated by comparing how efficiently a sample of purified EGIcore-HFBI in 50 
mM acetate buffer pH 5.0 was extracted to the detergent phase in the presence and 
absence of purified HFBI and HFBII. The extraction was followed by measuring the 
depletion of hydrolytic activity on a soluble substrate such as methylumbelliferyl 
10 cellobioside in the aqueous phase before and after extraction. When 0.7 g/l of either 
purified hydrophobin was present in the separation of 0.02 g/l of EGIcore-HFBI with 2% 
C12-18E05 at 30°C the extraction was affected in the following way: 93% of the 
protein was extracted when additional hydrophobin was not present, but with HFBII 
present 82% was extracted and with HFBI present 88% was extracted 

15 

Example 24 

Production of EGIcore-HFBI fusion proteins in T. reesei Ahfb2 strain for improved 
partitioning of the fusion protein in ATTS 

2 0 Trichoderma reesei strain QM9414 Ahfb2 (VTT-D-99726) was transformed essentially as 
described (Penttila et aL, 1987) using 10 ug of the plasmid pMQl 13 (described in Example 
1) together with 3 ug of the selection plasmid pTOC202 containing the amdS gene (Hynes et 
a/., 1983); Tilbum et al, 1983) of Aspergillus nidulans encoding for acetamidase. pMQl 13 
contains an expression cassette for production of EGIcore-HFBI fusion protein under the 

2 5 control of cbhl promoter and terminator sequences. 



The Amd+ transformants obtained were streaked two times onto plates containing acetamide 
(Penttila et al., 1987). Thereafter spore suspensions were made from transformants grown on 
Potato Dextrose agar (Difco). The production of the EGIcore-HFBI fusion protein was 
3 0 tested by slot blotting or Western analysis with EGI and HFBI specific antibodies from shake 
flask or microtiter plate cultivations carried out in minimal medium supplemented with Solka 
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flock cellulose. The spore suspensions of the clones producing fusion protein were purified to 
single spore cultures on selection plates containing acetatnide. To determine the best 
producers, production of the fusion protein was analyzed again from these purified clones as 
described above. 

5 

For partitioning experiments of the EGIcore-HFBI fusion protein in ATPS using the 
polyoxyethylene detergent C12-18E05 the best production strain obtained in this study and 
as control strains VTT-D-98691 (QM9414 strain producing EGIcore-HFBI) and VTT-D- 
74075 (QM9414) were cultivated in shake flasks on medium containing Solka flock cellulose 
10 as described in Example 4. 

Standard partitioning experiments as described in Example 5 are carried out with culture 
supernatants. After separation the volume ratio of the lighter and heavier phase is noted and 
the concentration factor for the fusion protein is calculated from it. Samples are also taken 
15 from the lighter and heavier phase and analysed with SDS-PAGE, Western blotting and 
activity measurements as described in Example 5. Partition coefficients (K) and yields (Y) are 
calculated as described in Example 5 . 



Example 25 

2 0 Construction of vectors for expression of EGIcore-HFBII and EGIcore-SC3 fusion 
proteins under the cbhl promoter of T. reesei 

For construction of an expression vector for production of EGIcore-HFBII fusion protein, 
hfbl (SEQ ID 20 ) coding region (from Ala- 16 to the STOP codon) was amplified by PCR 

2 5 using phfb2 (Nakari-Setala et al. 1997) as a template and as a 5' primer CGG AG G AGC 

JCG ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG GCT GTC TGC CCT 
ACC GG (SEQ ID 21) and as a 3' primer TCA TTGGATCCT TAG AAG GTG CCG 
ATG GC (SEQ ID 22). The sequence is bold in the 5' primer encodes for amino acids 413- 
425 of EGI and the underlined GAGCTC is a Sad site. The underlined GGATCC in the 3' 

3 0 primer is a BamHI site. The amplified fragment was digested with Sacl and BamHl and 

ligated to pMQl 13 cut with the same restriction enzymes. The resulting plasmid is pTNS32 
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(Figure 18) and it carries the coding sequence for EGIcore-HFBU fusion protein under the 
control of cbhl regulatory sequences (SEQ ID 5 and SEQ ID 7). 

For construction of an expression vector for production of EGIcore-SC3 fusion protein, SC3 
cDNA (SEQ ID 23) was ampified with PCR using cSC3/pUC20 plasmid as a template and as 
a 5' primer ACT ACA CGG A GG AGC TC G ACG ACT TCG AGC AGC CCG AGC 
TGC ACG CAG GGT GGC CAC CCG GGC (SEQ ID 24) and as a 3' primer TCG TAC 
GGA TCC TCA GAG GAT GTT GAT GGG (SEQ ID 25). The sequence is bold in the 5' 
primer encodes for amino acids 413-425 of EGI and the underlined GAGCTC is a Sad site. 
The underlined GGATCC in the 3 f primer is a BamHI site. The amplified fragment was 
digested with Sad and BamHI and ligated to pMQ103 (described in Example 1) cut with the 
same restriction enzymes. The resulting plasmid is pTH4 (Figure 19) and it carries the coding 
sequence for EGIcore-SC3 fusion protein under the control of cbhl regulatory sequences 
(SEQ ID 5 and SEQ ID 7). 

The cSC3/pUC20 plasmid contains the 41 1 bp SC3 cDNA from the translational start site to 
the STOP codon in a pUC20 vector. The translational start site has been constructed to a 
Ncol site and a BamHI site has been added after the translational STOP codon. 

Example 26 

Construction of a vector for expression of a HFBI-dCBD fusion protein containing 
hydrophobin I and double cellulose binding domains (CBD) under the cbhl 
promoter of T. reesei 

For construction of an expression cassette for production of HFBI -dCBD fusion protein 
under cbhl promoter, the protein coding region of hfbl was amplified with PCR using 
pEAlO (Nakari-Setala et al. Eur. J. Biochem. (1996) 235:248-255) as a template. GGA 
ATT CCG CGG ACT GCG CAT CAT GAA GTT CTT CGC CAT CGC C (SEQ ID 
26) was used as a 5 7 primer in the PCR and TGA ATT C CA TAT G TT AGG TAC CAC 
CGG GGC CCA TGC CGG TAG AAG TAG AAG CCC CGG GAG CAC CGA CGG 
CGG TCT GGC AC (SEQ ID 27) as a 3' primer. The sequence in bold in the 5 ? primer 
is 16 bp of cbhl promoter adjacent to translational start site of the corresponding gene 



WO 00/58342 PC T7FI 00/0 0249 

38 

and the underlined CCGCGG is a Kspl site. The underlined and bold sequences in the 3 s 
primer are Ndel and Asp718 sites, respectively. The sequence in Italics in the 3' primer 
encodes for a Methionine-containing linker (PGASTSTGMGPGG) The resulting 
fragment of 370 bp was digested with Kspl and Ndel and ligated to pAMHUO 
5 (Nevalainen, K.M.H., Penttila, M.E., Harkki, A., Teeri, T.T. and Knowles, J. (1991) In 
Molecular Industrial Mycology. Eds. Leong, S.A. and Berka, R.. Marcel Dekker. New 
York) digested with same restriction enzymes. The resulting plasmid is pTNS29-2Asp. 

For further cloning steps, an Asp718 site was removed from the pTNS29-2Asp 
1 0 polylinker. The vector was digested with Sad and BamHI and the cleaved vector ends 
were blunted with T4 DNA polymerase and ligated together. The resulting vector 
pTNS29 is missing the Sad, Asp718 and Smal sites present in the pTNS29-2Asp 
polylinker. 

15 In the next step, a double cellose binding domain (dCBD) was amplified with PCR using 
pTNSll as a template. pTNSll carries the coding sequence for a fusion protein 
containing T. reesei CBHII CBD (41 N-terminal residues of CBHII) and CBHI CBD 
linked together via CBHI linker region (CBHI linker and CBD are the last 57 residues in 
CBHI). This DNA sequence originates from an E. coli expression vector B599 which is 

2 0 essentially the same as the one described by Under et al. (J. Biol. Chem. (1996) 
271:21268-21272). TGA ATT C GG TAC C CA GGC TTG CTC AAG CGT C (SEQ ID 
28) was used as a 5' primer in the PCR and TGA ATT CCATATGTC ACA GGC ACT 
GAG AGT AGT A (SEQ ID 29) as a 3' primer. The underlined sequences in the 5' and 3' 
primers are Asp718 and Ndel sites, respectively. The amplified fragment was digested 

2 5 with Asp718 and Ndel and ligated to pTNS29 digested with Asp718 and Ndel resulting 

in pTNS30. 

pTNS30 (Figure 20) expression vector thus carries the coding region for a fusion protein 
consisting of HFBI and double CBD linked in frame via the Methionine linker peptide 

3 0 (PGASTSTGMGPGG). Expression of the fusion protein is regulated by the cbhl 

transcriptional control sequences. The expression cassette may be released from the 
plasmid with EcoRI and SphL 
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Example 27 

Construction of vectors for expression of HFBI-singlc chain antibody fusion proteins 
under the cbhl promoter of T. reesei 

5 An expression vector was constructed for production of a fusion protein consisting of T. 
reesei HFBI protein in the N-terminus and in the C-terminus a single chain antibody 
recognizing a small molecular weight derivative of diarylalkyltriazole (ENA5ScFv). 
Production of the fusion protein is under the cbhl regulatory sequences. For construction of 
HFBI-ENA5ScFv fusion protein, pENA5ScFv was digested with Ncol and Xbal. The 

1 0 fragment containing the enaSscp gene and the histidine tail (6 x His) was bhit-end cloned to 
pTNS29 resulting in pTHl (Figure 21). pENASScFv vector carries the coding region for 
ENA5 single chain antibody consisting of the variable domains of the heavy and light chains 
connected via a glycine serine linker and a 6 x histidine tag at the C-terminal end. 
Transcription and secretion of the single chain antibody are under control of the tac promoter 

15 and pelB signal sequence, respectively (Takkinen et al. y 1991). pTNS29 vector carries the 
hfbl coding region of T. reesei followed by a linker sequence 
(ProGlyAlaSerThrSerThrGlyMetGlyProGlyGly) under the control of cbhl promoter and 
terminator sequences. 

2 0 For construction of HFBI-ENA5ScFv fusion protein with a thrombin cleavage site in the 

linker peptide, enaSscjv coding region (from Ala-23 to the STOP codon) and a peptide linker 
containing the thrombin cleavage site (Gly Thr Leu Val Pro Arg Gly Pro Ala Glu Val Asn 
Leu Val) preceeding it was amplified with PCR using pENASScFv as a template and as a 5' 
primer GAA TTC GGT ACC CTC GTC CCT CGC GGT CCC GCC GAA GTG AAC 
2 5 CTG GTG (SEQ ID 30) and as a 3' primer TGA ATT C CA TAT G CT AAC CCC GTT 
TCA TCT CCA G (SEQ ID 31). The sequence in bold in the 5' primer encodes the first 6 
N-terminal residues of ENA5SCFV. The sequence in italics is a thrombin cleavage site and 
underlined GGT ACC is an Asp718 site. The sequence in bold in the 3' primer encodes the 6 
C-terminal residues of ENASScFv and the underlined CA TATG is a Ndel site. The 790 bp 

3 0 PCR fragment was purified from agarose gel and ligated to pTNS29 resulting in pTH2 

(Figure 22). 
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Example 28 

Construction of a vector for production of class I hydrophobin SC3 of Schizophyllum 
commune in Ttichoderma reesei for ATPS 

5 AT. reesei strain producing class I hydrophobin SC3 of S. commune was constructed. For 
that purpose an expression vector carrying the SC3 cDNA under the control of hfbl 
promoter and hfbl terminator of T. reesei was constructed. 

The hfbl terminator (SEQ ID 32) was amplified with PCR using as a template pEAlO 
1 0 (Nakari-Setala et al. 1996) and as a 5' primer GAC CTC GAT GCC CGC CCG GGG TCA 
AG (SEQ ID 33) and as a 3' primer GTC GAC ATT TCA TTT TAC CCC CCT CG 
(SEQ ID 34). The underlined sequences in the 5' and 3' primers are Sad and Sail sites, 
respectively. The PCR fragment was cut with Sad and Sail so that the Sad site was 
blunted with Klenow. The fragment was cloned into cSC3/pUC20 (described in Example 
1 5 25) vector digested with Sail and Bamffl where the BamHI was blunted with Klenow. In 
the next step, the hfb2 promoter (SEQ ID 35) was amplified with PCR using as a template 
pTNS8 (Nakari-Setala et al. 1997) and using as a 5' primer AAG CTT GCA TGC CTG 
CAT CC (SEQ ID 36) and as a 3' primer CCA TGG TGA AAG GTG GTG ATG GTT 
GG (SEQ ID 37). The underlined sequences in the 5' and 3' primers are Hindlll and 
20 Ncol sites, respectively. The PCR fragment was digested with Hindlll and Ncol and 
cloned in front of the SC3 cDNA in the plasmid obtained in the previous step and cut 
with the same restriction enzymes. The resulting plasmid is pKS2 (Figure 23). 

Example 29 

25 Construction of T. reesei strains producing EGIcore-HFBH, EGIcore-SC3, HFBI- 
dCBD and HFBI-single chain antibody fusion proteins, and SC3 hydrophobin 

Trichoderma reesei strains VTT-D-74075 (QM9414), VTT-D-86271 (Rut-C30) and VTT- 
D-99676 (Rut-C30 6hfb2) were co-transformed essentially as described (Penttila et al, 
30 1987) using 3-13 ug of the plasmids pTNS32 ; pTHl, pTH2, P TH4, pTNS30 and pKS2 and 
as the selection plasmids 1-3 ug P ToC202 or pAR021. P ToC202 carrying the amdS gene 
(Hynese/a/., 1983; Tilburn et al., 1983) of A. nidulans and pARQ21 carrying the £. coli hph 
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gene have been described in Example 3. The Amd+ and Hyg+ transformants obtained were 
streaked three times onto plates containing acetamide and hygromycin, respectively (Penttila 
et aL, 1987). Thereafter spore suspensions were made from transformants grown on Potato 
Dextrose agar (Difco). 

5 

The production of the fusion proteins EGIcore-HFBH, EGIcore-SC3, HFBI-dCBD and 
HFBI-ENA5ScFv and the SC3 hydrophobin was tested by slot blotting or Western analysis 
with EGI, SC3, CBD and HFBI specific antibodies from shake flask or microtiter plate 
cultivations carried out in minimal medium supplemented with either lactose or Solka flock 
0 cellulose. The spore suspensions of the fusion protein producing clones were purified to 
single spore cultures on selection plates (containing either acetamide or hygromycin). To 
determine the best producers, production of the fusion proteins was analyzed again from 
these purified clones as described above. 

5 T. reesei strains selected for further cultivations are X46A (pTNS32, host QM9414), VTT- 
D-00793 (pTH4, host Rut-C30 tshfb2\ VTT-D-99727 (pTNS30, host Rut-C30 tshft>2\ 
VTT-D-00791 (pTHl, host Rut-C30) and VTT-D-00792 (pKS2, host Rut-C30 tshfbl). 
These strains were cultivated as described in Example 4 and the culture supernatants were 
subjected to ATPS. 

0 

Example 30 

Recovery of the protein product from the enriched detergent phase after ATPS by 
isobutanol or other solvents 

5 After phase separation has occurred and the hydrophobin or the hydrophobin fusion 
protein is enriched in the detergent phase (the enriched phase), the protein can be 
recovered to an aqueous buffer by addition of isobutanol or other solvent. For example a 
series, of identical extractions of 0.05 g/1 HFBI in 50 mM acetate buffer was made with 
2% Berol 532. To separate tubes 10% final concentration of either isobutanol, n-amyl 

0 alcohol, octanol, or octane was added, and the fraction of hydrophobin recovered to the 
aqueous phase was analyzed. It was found that 100% was recovered with isobutanol, 
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89% with n-amyl alcohol, 81 % with octanol and 70 % with octane. Fractions were 
analyzed by HPLC as described in Example 38. 

5 Example 31 

Separation of EGIcore-HFBII fusion protein in ATPS. 

T.reesei strain X46A was grown in shake flasks on 3% lactose medium as described in 
Example 4. C12-18E05 detergent was added to a final concentration of 5% to the 
culture supernatant. After mixing the system was allowed to settle and the enriched 

10 detergent phase was subjected to further extraction with an equal volume of isobutanol. 
After buffer exchange on Biogel P-6 (Bio-Rad, USA) gel filtration media, the extracted 
protein was analysed by ion exchange chromatography using a Mono Q (Amersham 
Pharmacia, Sweden) column equilibriated with 1 5 mM acetate and eluted using a linear 
gradient of NaCL Separation of the EGIcore-HFBII fusion protein to the detergent phase 

1 5 was verified by analyzing the eluted peak fraction from the Mono Q chromatography run 
by its activity on 4-methylumbeUiferyl cellobioside as well as its re-extraction by ATPS. 

Example 32 

Separation of EGIcore-SC3 fusion protein in ATPS 

20 

Culture filtrate of T.reesei strain VTT-D-00793 producing EGIcore-SC3 fusion protein 
was subjected to APTS by adding 5% final concentration of C12-18EOS. After the 
settling, the phases were separated and the fusion protein enriched detergent phase was 
further treated with an equal volume of isobutanol which removes the detergent to the 
2 5 isobutanol and leaves the protein in the aqueous phase. The resulting fusion protein 
enriched aqueous phase was then desalted on Biogel P-6 (Biorad, USA). Separation of 
fusion protein to the detergent phase was verified by analyzing the enzymatic activity of 
the EGIcore fusion partner on 4-methylumbelliferyl cellobioside in that phase. 
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Example 33 

Separation of HFBI-dCBD fusion protein in ATPS 

T.reesei strain VTT-D-99727 producing HFBI-dCBD fusion protein was grown on 
lactose-containing medium as described in Example 4. Berol 532 was added to a final 
concentration of 2 % to 500 ml of culture supernatant. The mixture was allowed to settle 
and the phases separated in a separation funnel. The enriched top phase (10 ml) was 
extracted with an equal volume of isobutanol and 50 mM acetate buffer pH5. The fusion 
protein was selectively enriched to the buffer in the separation as shown by SDS gel 
electrophoresis (Figure 24). 

Example 34 

Recovery of the HFBI-dCBD fusion partners after chemical cleavage 

HFBI-dCBD protein produced by the strain VTT-D-99727 has a methionine 
(PGASTSTGMGPGG) designed in the linker region between the HFBI and the dCBD, 
which would enable the recovery of the native HFBI and dCBD after chemical cleavage with 
CNBr. 

The HFBI-dCBD fusion protein was purified as described in Example 33 . The resulting water 
phase (-108 ml) still contained a small amount of CBH1 and free hydrophobin in addition to 
the HFBI-dCBD fusion (Figure 24). Further purification of the sample was performed by 
chromatography. The sample was desalted on a Biogel P-6 column equilibrated with 50mM 
sodium acetate buffer pH 5.5, diluted 1 + 3 with water and applied on a CM-Sepharose FF 
column equilibrated with lOmM sodium acetate buffer pH 5.5. The CBHI protein was found 
in the flow through and the purified HFBI-dCBD protein was eluted with 0.2M NaCI. 

A sample of purified HFBI-dCBD protein was evaporated in speed-vac almost to dryness. 
CNBr cleavage was performed by adding a solution of CNBr in 0.1 M HC1 (5mg/ml) to the 
evaporated sample in 5 times weight excess. The reaction was allowed to continue for 24h at 
room temperature (in dark). A 1 0 times volume of water was added and the sample was 
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evaporated in speed-vac almost to dryness. The evaporated sample was diluted in 50mM 
Tris-HCl pH 7 and the cleavage was confirmed by HPLC analysis as described in Example 
37. 

5 The treatment was further optimised by using different concentrations of HC1 and incubation 
times. The final CNBr treated sample was subjected to ATPS as described in Example 33. 
HFBI was found in the top-phase and the dCBD in the bottom phase, thus leading to a 
separation of both fusion partners. 

1 0 Example 35 

Separation of HFBI-single chain antibody fusion protein in ATPS 

Strain VTT-D-00791 was cultivated for 7 days on medium containing 10 g/1 potassium 
phtalate, 15 g/l KH 2 P0 4 , 5 g/L (NHUfeSC^ and 3% lactose and 0.2% peptone. 30 ml of 
1 5 culture medium was subjected to ATPS using 4% C12-18-E05 detergent as described in 
Example 5. After removal of the bottom phase, the detergent phase was extracted by 
isobutanol. Proteins from the water phase after detergent extraction and the bottom phase of 
the isobutanol extracted detergent phase were precipitated with trichloroacetic acid (final 
concentration 10%), resuspended in SDS-PAGE sample buffer (taking into acount the 

2 0 concentration factor of the isobutanol extracted detergent phase) and analysed by Western 

blotting using HFBI specific antibodies. The fusion protein runs as a dimer (approximately 70 
kDa ) in SDS-PAGE under the conditions used as seen in Figure 25. The water phase sample 
still contained a small amount effusion protein. Nevertheless, separation of the fusion protein 
can be obtained by ATPS. 

25 

Example 36 

Separation of SC3 hydrophobin in ATPS 

SC3 hydrophobin producing strain VTT-D-00792 was cultivated in shake flasks on lactose- 

3 0 containing medium as described in Example 4. Soluble secreted proteins produced by the 

fungus were precipitated with trichloroacetic acid. Precipitated proteins were solubilized in 
trifluoroacetic acid to dissociate the SC3 aggregates whereafter the acid was evaporated with 
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a stream of air. The treated proteins were solubilized in 2 ml water containing 2.5 % of the 
detergent C12-18E05. Phase separation was carried out as described in Example 5. Samples 
were taken from the lighter and heavier phase and analysed by Western blotting using SC3 
specific antibodies. On the basis of this analysis, SC3 hydrophobin partitions to the detergent 
5 phase (Figure 26). 

Example 37 

Separation of nisin in ATPS 

10 1 mg of pure nisin (Sigma) (equals to 1000 IU) in 50 mM sodium acetate buffer pH 5 was 
subjected to ATPS using 2% of the detergent C12-18E05 at 30°C, and the phases were 
allowed to settle. After removal of the bottom phase, the detergent phase was extracted by 
isobutanol which removes the detergent to the isobutanol and leaves nisin in the aqueous 
phase. Separation of nisin to the enriched aqueous phase was assayd testing nisin 

15 bioactivity as described by Qiao et al. 1996. Comparison of the halos on the assay plate 
produced by the enriched phase and the nisin controls indicated that nisin had separated 
to the detergent phase with a concentration factor of approximately 5. 

2 0 Example 38 

Preparative purification of HFBI and HFBII in ATPS 

For preparative purposes, HFBI and HFBI were extracted from 0.5 1 of culture 
supernantants from cultivations carried out on media containing glucose or cellulose as 

2 5 described in Example 4. 2% (w/w) detergent was added to the culture supernatant which 

was mixed and then allowed to settle in a separation funnel at 20°C if C11E02 (Berol 
532) was used and 30°C if C12-18E05 (Henkel) was used. The detergent phase 
(enriched phase) was collected and mixed with an equal volume of isobutanol. When 
C11E02 was used an equal volume of 50 mM acetate buffer was also added. The 

3 0 remaining culture supernatant is the depleted phase. To follow the purification, analytical 

HPLC was run on samples from each step (Figure 27and 28). On the basis of HPLC 
analysis, both HFBI and HFBII partitioned well to both of the detergents. When 
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C11E02 was used, K values of >1000 and >78 were obtained for HFBI and HFBI1, 
respectively. 

Water phase after isobutanol extraction was collected and fiirher purified on a Vydac 
5 1x20 cm semipreparative C4 column (Vydac, USA) eqilibriated with 0.1% trifluoro 
acetic acid in water and eluting with a linear gradient of 0.1% trifluoro acetic acid in 
acetonitrile. The proteins were then lyophilized. In analytical runs a 4.6 mmx20 cm 
Vydac C4 column was used. 

1 0 Example 39 

Screening for separation conditions using purified HFBI and HFBII 

HFBI and HFBII which were purified from culture supernatants in ATPS, further 
purified by preparative HPLC and lyophilized as described in Example 38 were dissolved 

15 in water to a concentration of 0.5 mg/ml. The stock solutions were then diluted with the 
buffer appropriate for the experiment. 50 mM acetate buffer pH5, 50 mM glycine pH 3, 
50 mM HEPES pH 7 was used for pH screening and NaCl or (NH4)2S0 4 in acetate 
buffer was used for ionic strength experiments. The surfactant was weighed with the 
appropriate amount of hydrophobin solution and incubated in 0, 20, 30, 40°C and the 

2 0 amount of hydrophobin in the depleted phase determined. Partitioning coefficients were 
calculated from the change in concentration and the volume ratios and are presented in 
the tables below. 
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Table presenting K- values for the separation of hydrophobins with different surfactants 
and temperature. 





Temperature 


Henkel 


Berol 532 


Berol 266 


Triton xl 14 


HFBI 


0°C 


0,1 


368 


NP 


NP 




20°C 


0,4 


1961 


NP 


NP 




30°C 


85 


2970 


NP 


0,9 




40°C 


148 


682 


28 


69 


HFBII 


0°C 


NP 


76 


NP 


NP 




20°C 


1,0 


139 


NP 


NP 




30°C 


67 


194 


NP 


4,0 




40°C 


44 


102 


19 


44 



5 NP = no phase separation 



Table presenting the effect of different salt concentrations at 40°C on K-values 







Henkel 


Berol 532 


Berol 266 


Triton xl 14 


HFBI 


50 mM buffer 


244 


578 


13 


39 




+ 1 M NaCl 


239 


33 


198 


241 




+1 M (NH4) 2 S0 4 


0,9 


3,2 


176 


612 














HFBII 


50 mM buffer 


36 


80 


16 


58 




+ 1 M NaCl 


14 


87 


61 


38 




+1 M (NH4) 2 S0 4 


3,1 


351 


95 


35 



Table presenting the K-values for the partitioning of hydrophobins in Berol 532 at 
1 5 different conditions. 





HFBI 0°C 


HFBI 20°C 


HFBII 0°C 


HFBII 20°C 


50 mM acetate pH5 


298 


614 


76 


127 


+1 M NaCl 


614 


3781 


131 


199 


+ 1M(NH4) 2 S04 


931 


44 


203 


139 


pH3 


614 


1881 


73 


160 


! pH7 


361 


614 


54 


126 
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Example 40 

Purification of a new putative T. reesei hydrophobia on the basis of its separation in 
ATPS, and cloning of the corresponding gene 

5 T. reesei strain VTT-D-99726 (QM9414 Ahfb2) was cultivated on lactose-based 
medium in 15 fermentor as described in Example 4. After cultivation, 1 1 of the culture 
filtrate was subjected to ATPS with 5% polyoxyethylene detergent C12-18E05. After 
phase separation at 30°C, detergent phase was separated from the depleted aqueous 
phase and analysed by SDS-PAGE. The analysed sample contained a protein which 

10 was about 7.5 kDa. In addition, some larger proteins were present. These were 
removed by hydrophobic interaction chromatography on a Phenyl-Sepharose FF 
column equilibriated with 50 mM sodium acetate - 1 M (NHU^SCU buffer. The 
proteins were eluted with descending (NHU^SC^ salt gradient. Fractions containing 
the ca. 7.5 kDa protein were pooled and concentrated, and analysed by mass 

1 5 spectrometry. This revealed that the purified protein was degraded to three peptides 
(2486, 2586 and 2709 Da), which were held together by disulphide linkages. The N- 
terminal amino acid sequence from the 2486 Da peptide was 
ANAFCPEGLLYTNPLCCDLL, which on the basis of position of cysteines and on 
sequence comparison to known hydrophobins is typical to a hydrophobin. 

20 

Degenerated primers were designed on the basis of this amino acid sequence as well 
as on the basis of the sequence obtained from the 2586 Da peptide. These primers 
were used in RT-PCR (RobusT RT-PCR Kit, Finnzymes) with RNA isolated from the 
same cultivation from where the protein was purified. The 140 bp fragment obtained 
2 5 from the RT-PCR reaction was sequenced. The obtained sequence contained a part 
that was coding for the 2486 Da peptide used to design the PCR primer confirming 
that the PCR product corresponds to the purified protein. 
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Example 41 

Purification of EGlcore-HFBI in ATPS in pilot scale 

The strain VTT-D-99702 producing EGlcore-HFBI fusion under the cbhl promoter was 
5 cultivated in pilot scale fermentor in 1200 L on minimal medium containing 4% lactose, 
0.4 % peptone and 0.1% Yeast Extract for 4 days. Cultivation temperatures were 28°C 
or alternatively the temparature was changed during the cultivation from 27°C to 22°C in 
a step- wise manner. Production level of the fusion protein was a few grams per liter. 
After the end of the cultivation, the mycelium was separated by means of a rotating 
1 0 vacuum drum filter using Celite 535 diatomaceous earth as filter aid. 



In separation 1, 1 100 L of the supernatant was transferred into the meanwhile cleaned 
bioreactor, adjusted to the separation temperature of 24,7 °C and mixed with salt and 
detergent to give 0,15 M ammonium dihydrogen phosphate and 4,1 % of detergent CI 2- 

15 18E05. The phases were left to separate by gravity settling and the heavier phase 
removed through the bottom valve. A parallel experiment in 10 ml volume was carried 
out to investigate the effect of scale up on the separation of the fusion protein. The 
partition coefficient and the concentration factor were equal for the 10 ml and 1200 L 
separation within the measurement error. The separated detergent phase was re- 

2 0 extracted using the detergent phase from the first extraction by replacing half of the 
volume of the bottom phase by tap water. The salt concentration was adjusted to 0,25 M 
ammonium dihydrogen phosphate at 30°C. K and Y values of the separations are 
presented in the below Table. 

2 5 Alternatively in separation 2, 10 ml of drum filtrated culture medium samples were 
subjected to ATPS at 25°C using 4.1% C12-18E05 detergent and 1.15 M (NH4)H 2 P0 4 . 
K and Y values after one separation step are presented in the Table below. 
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Table presenting extraction experiments of culture supernatants from different pilot scale 
cultivations 



Separation 


Cultivation 
temperature 


Concentration factor 


K 


Y [%] 


Separation 1 


27°C -> 22°C 


3.4 


3.8 


59 


Re-extraction 






8 


70 


Separation 2 


29°C 


2.8 


8.3 


81 



5 

Example 42 

Separation of EGI, EGIcore-HFBI fusion protein and HFBI hydrophobin in robust 
micelle/polymer systems 

10 Separations were performed with purified samples. HFBI was purified by two-phase 
separation and HPLC as decribed in Example 38. EGIcore-HFBI was purified by two-phase 
extraction as decribed in Example 5 followed by desalting on a Biogel P-6 column 
equiliberated with 20mM sodium acetate puffer pH 6 containing 150mM NaCl. The purified 
proteins were subjected to ATPS using different micelle/polymer systems. 

15 

The table below presents partitioning coefficients of pure EGI, HFBI and EGIcore-HFBI 
at constant tie-line length, in the different systems, Triton X-114/water system (4.1 % 
wt.) at 31.5° C, Triton X-100/Reppal/water system (8.1 % wt. / 8.2 % wt.) and Triton 
X-114/Reppal/water system (5.0 % wt. / 4.0 % wt.) at 21° C. The standard deviation is 
2 0 shown within the brackets. The systems were buffered with 25 mM sodium acetate, pH 
4.0. K > 1 is equivalent to a preferred protein partitioning to the micelle rich phase. 





EGI 


HFBI 




EGIcore-HFBI 




Kegj 


Khfb 


Keoihfb Yield (%) 
top phase 


TritonX-114 


0.6 (0.1) 


21.5 (1.5) 


8.4 (1.0) 


85 


Triton X- 1 00/Reppa! 


0.9 (0.1) 




1.7 (0.1) 


56 


Triton X-114/Reppal 


0.7 (0.1) 


11.0 (1.1) 


15.4 (2.9) 


91 
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Claims 

1. A process for partitioning of proteins or cells in aqueous two-phase systems (ATPS), 
comprising the steps of 

a) in order to obtain a fusion protein or cell, combining a protein or a cell of interest 
to a targeting protein selected from the group consisting of amphipathic and hydrophobic 
proteins having the ability to partition in ATPS and to cany said protein or cell of 
interest into one of the phases, and 

b) subjecting said fusion protein or cell carrying the targeting protein to an ATPS 
separation. 

2. The process according to claim 1, wherein the targeting protein is a hydrophobin or 
hydrophobin-like protein or a part thereof. 

3. The process according to claim 2, wherein the hydrophobin is a Trichoderma 
hydrophobin or a part thereof 

4. The process according to claim 3, wherein the Trichoderma hydrophobin is HFBI, 
HFBII or SRHT ,or a part thereof. 

5. The process according to claim 1, wherein the amphipathic or hydrophobic proteins 
or parts thereof form aggregates. 

6. The process according to any one of claims 1 to 5 for partitioning cells in ATPS, 
wherein in step a) of claim 1 the combination of the cells of interest to the targeting 
protein comprise bringing said targeting protein onto the surface of said cells. 

7. The process according to claim 6, wherein the cells are yeast cells. 

8. The process according to claim 6 wherein the cells are spores. 

9. The process according to any one of claims 6 to 8, wherein the targeting protein is 
fused to a protein which brings the targeting protein onto the surface of the cell. 
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10. A fusion protein, comprising a hydrophobin or hydrophobin-like protein as defined in 
any one of claims 2 to 5 fused to a protein of interest. 

1 1 . The fusion protein according to claim 10, wherein the protein of interest is a cell- 
5 bound protein or a part thereof. 

12. The fusion protein according to claim 10, wherein the protein of interest is an 
extracellular protein or a part thereof. 

10 1 3 . The fusion protein according to claim 1 2, wherein the extracellular protein is an 

extracellular protein of Trichoderma, selected from the group consisting of cellulases, 
hemicellulases and proteases. 

14. The fusion protein according to claim 10, wherein the protein of interest is an 
1 5 antibody protein or a part thereof. 

15. The process according to claim 1, wherein the targeting protein is fused to the 
protein of interest according to any one of the claims 10 to 14. 

20 16. A recombinant organism producing a fusion protein according to any one of claims 
10 tol4. 

17. The recombinant organism according to claim 16, wherein the organism has been 
genetically modified to be capable of producing a fusion protein according to any one of 

25 claims 10 to 14. 

18. A recombinant DNA molecule, comprising a DNA molecule encoding a fusion 
protein according to any one of claims 10 to 14. 

30 19. A process for producing a fusion protein according to any one of the claims 10 to 14 
with recombinant organisms, comprising the steps of 

a) transforming the organism with DNA molecules enabling expression of such proteins, 
and 



WO 00/58342 ^ PC T/FI 00/002 49 

b) recovering such protein from the culture of the recombinant organism. 



20. The process according to any one of claims 1 to 9 and 15, wherein the aqueous two- 
phase system is selected from the group consisting of PEG/salt, PEG/Dextran and 
PEG/starch systems or derivatives thereof, detergent-based aqueous two-phase systems 
and thermoseparating polymer systems. 

21. The process according to claim 20, wherein the detergent-based ATPS comprises a 
detergent which is selected from the group consisting of nonionic and zwitterionic 
detergents. 

22 The process according to claim 20, wherein the thermoseparating polymer system 
comprises a polymer which is selected from the group consisting of polyethylene- 
polypropylene copolymers. 

23. The process according to any one of claims 1-9 and 15, wherein the protein or cell of 
interest is separated from a suspension containing cells or cell extracts. 

24. A process for separating hydrophobins or hydrophobin-like proteins or parts thereof 
in aqueous two-phase systems, comprising the steps of 

a) mixing solutions containing said hydrophobin, hydrophobin-like protein or parts 
thereof wi th the phase forming chemicals, and 

b) carrying out ATPS separation, 

wherein the aqueous two-phase system is as defined in any one of claims 20 to 22. 
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SEQUENCE LISTING 

<110> Valtion teknillinen tutkimuskeskus 

<12Q> Process for partitioning of molecules 

<130> 31805 

<140> 
<141> 

<160> 42 

<170> Patencln Ver. 2.2 

<210> 1 
<211> 428 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<221> intron 
<222> (167) . . (236) 

<220> 

<221> intron 
<222> (3235 . . (386) 

<220> 

<2 23> Coding sequence of hfbl 
<400> 1 

atgaagttct tcgccatcgc cgctctcttt gccgccgctg ccgttgccca gcctctcgag 60 
gaccgcagca acggcaacgg caatgtttgc cctcccggcc tcttcagcaa cccccagtgc 120 
tgtgccaccc aagtccttgg cctcatcggc cttgactgca aagtccgtaa gttgagccat 180 
aacataagaa tcctcttgac ggaaatatgc cttctcactc ctttacccct gaacagcrtc 240 
ccagaacgtt tacgacggca ccgacttccg caacgtctgc gccaaaaccg gcgcccagcc 3 00 
tctctgctgc gtggcccccg ttgtaagttg atgccccagc tcaagctcca gtctttggca 360 
aacccattct gacacccaga ctgcaggccg gccaggctct tctgtgccag accgccgtcg 420 
gtgcttga 428 

<210> 2 
<211> 78 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 2 

tcgggcacta cgtgccagta tagcaacgac tactactcgc aatgccttgt tccgcgtggc 60 
tctagttctg gaaccgca 78 



WO 00/58342 PCT/FIOO/00249 

<210> 3 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 3 

tcgtacggat cctcaagcac cgacggcggt 3 0 



<210> 4 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 4 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcagag caacggcaac 6 0 

ggc 6 3 



<210> 5 
<211> 2211 
<212> DNA 

<213> Trichoderma. reesel 
<220> 

<221> promoter 

<222> (1) . . (2211) 

<22 3> chhl promoter sequence 

<400> 5 

gaattctcac ggtgaatgta ggccttttgt agggtaggaa ttgtcactca agcaccccca 60 

acctccatta cgcctccccc atagagttcc caatcagtga gtcatggcac tgttctcaaa 120 

tagattgggg agaagttgac ttccgcccag agctgaaggt cgcacaaccg catgatatag 18 0 

ggtcggcaac ggcaaaaaag cacgtggctc accgaaaagc aagatgtttg cgatctaaca 24 0 

tccaggaacc tggatacatc catcatcacg cacgaccact ttgatctgct ggtaaactcg 300 

tattcgccct aaaccgaagt gcgtggtaaa tctacacgtg ggcccctttc ggtatactgc 360 

gtgtgtcttc tctaggtgca ttctttcctt cctctagtgt tgaattgttt gtgttgggag 420 

tccgagctgt aactacctct gaatctctgg agaatggtgg actaacgact accgtgcacc 480 

tgcatcatgt atataatagt gatcctgaga aggggggttt ggagcaatgt gggactttga 54 0 

tggtcatcaa acaaagaacg aagacgcctc ttttgcaaag ttttgtttcg gctacggtga 600 

agaactggat acttgttgtg tcttctgtgt atttttgtgg caacaagagg ccagagacaa 660 

tctattcaaa caccaagctt gctcttttga gctacaagaa cctgtggggt atatatctag 72 0 

agttgtgaag tcggtaatcc cgctgtatag taatacgagt cgcatctaaa tactccgaag 780 

ctgctgcgaa cccggagaat cgagatgtgc tggaaagctt ctagcgagcg gctaaattag 84 0 

catgaaaggc tatgagaaat tctggagacg gcttgttgaa tcatggcgtt ccattcttcg 900 

acaagcaaag cgttccgtcg cagtagcagg cactcattcc cgaaaaaact cggagattcc 960 

taagtagcga tggaaccgga ataatataat aggcaataca ttgagttgcc tcgacggttg 1020 

caatgcaggg gtactgagct tggacataac tgttccgtac cccacctctt ctcaaccttt 1080 

ggcgtttccc tgattcagcg tacccgtaca agtcgtaatc actattaacc cagactgacc 1140 

ggacgtgttt tgcccttcat ttggagaaat aatgtcattg cgatgtgtaa tttgcctgct 1200 

tgaccgactg gggctgttcg aagcccgaat gtaggattgt tatccgaact ctgctcgtag 1260 

aggcatgttg tgaatctgtg tcgggcagga cacgcctcga aggttcacgg caagggaaac 1320 
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caccgatagc 
caaaccaatg 
taattgtaca 
aagcaacggc 
gatcccccaa 
gtctgactcg 
acattcaagg 
cgatacgacg 
ggcactgaac 
ggctttgggt 
tgctgccttt 
ggtttcgaat 
aacgaaatga 
gcctccctca 
catcttttga 



agtgtctagt 
gctaaaagta 
atcaagtggc 
aaagcccact 
ttgggtcgct 
gagcgttttg 
agtatttagc 
aatactgcat 
aggcaaaaga 
gtacatgttt 
accaagcagc 
agaaagagaa 
gctagtaggc 
tgctctcccc 
ggcacagaaa 



agcaacctgt 
cataagttaa 
taaacgtacc 
tcccacgttt 
tgtttgttcc 
catacaacca 
cagggatgct 
agtcacttct 
ttgagttgaa 
gtgctccggg 
tgagggtatg 
gcttagccaa 
aaagtcagcg 
atctactcat 
cccaatagtc 



aaagccgcaa 
tgcctaaaga 
gtaatttgcc 
gtttcttcac 
ggtgaagcga 
agggcagtga 
tgagtgtatc 
gatgaagtgg 
actgcctaag 
caaatgcaaa 
tgataggcaa 
gaacaatagc 
aacgtgtata 
caactcagat 
aaccgcggac 



tgcagcatca 
agtcatatac 
aacgcgtcgt 
tcagtccaat 
aagaagacag 
tggaagacag 
gtgtaaggag 
tccatattga 
atctcgggcc 
gtgtggtagg 
atgttcaggg 
cgataaagat 
tataaaggtt 
cctccaggag 
tgcgcatcat 



ctggaaaata 
cagcggctaa 
ggggttgcag 
ctcagctggt 
aggtaagaat 
tgaaatgttg 
gtttgtctgc 
aatgtaagtc 
ctcgggcttc 
atcgaca^ac 
gccactgcat 
agcctcatta 
cgaggtccgt 
acttgtacac 

g 



1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2211 



<210> 6 
<211> 1588 
<212> DNA 

<213> Trx choderma reesel 



<220> 
<223> T. 



reesei egll cDNA 



<400> 6 

cccccctatc 

accacggcca 

cccgaggtcc 

caggacacct 

tcgtgcaccg 

aagaactgct 

agcctcacca 

cggctgtatc 

ctgagcttcg 

tctcagatgg 

agcggctact 

agccaccagg 

gccttgaccc 

ccctatggca 

accttcacca 

agcatcaccc 

gacaccatct 

gccctgagca 

aactggctcg 

atcctggcca 

gggtctacta 

actacacgga 

cagtgcggtg 

tatagcaacg 

gacgggggca 

aagacatgct 

agcaaaaaaa 



ttagtccttc 
tcctggccat 
atcccaagtt 
cggtggtcct 
tcaacggcgg 
tcatcgaggg 
tgaaccagta 
tcctggactc 
acgtcgacct 
acgagaacgg 
gcgatgctca 
gcttctgctg 
ctcactcttg 
gcggctacaa 
tcatcaccca 
gcaagtacca 
cgtcctgccc 
gcggcatggt 
acagcggcaa 
acaaccccaa 
cgaactcgac 
ggagctcgac 
gcattgggta 
actactactc 
cgatagaatg 
atgttgtatc 
aaaaaaaaaa 



ttgttgtccc 
tgcccggctc 
gacaacctac 
tgactggaac 
cgtcaacacc 
cgtcgactac 
catgcccagc 
tgacggtgag 
ctctgctctg 
gggcgccaac 
gtgccccgtc 
caacgagatg 
cacggccacg 
aagctactac 
gttcaacacg 
gcaaaacggc 
gcccgcctca 
gctcgtgttc 
cgccggcccc 
cacgcacgtc 
tgcgcccccg 
gacttcgagc 
cagcgggtgc 
gcaatgcctt 
cgggcacgca 
tacattagca 
aaaaaaaa 



aaaatggcgc 
gtcgccgccc 
aagtgtacaa 
taccgctgga 
acgctctgcc 
gccgcctcgg 
agctctggcg 
tacgtgatgc 
ccgtgtggag 
cagtataaca 
cagacatgga 
gatatcctgg 
gcctgcgact 
ggccccggag 
gacaacggct 
gtcgacatcc 
gcctacggcg 
agcatttgga 
cgcagcagca 
gtcttctcca 
cccccgcctg 
agcccgagct 
aagacgtgca 
tagagcgttg 
gggagctcgt 
aatgacaaac 



cctcagttac 
agcaaccggg 
agtccggggg 
tgcacgacgc 
ctgacgaggc 
gcgtcacgac 
gctacagcag 
tgaagctcaa 
agaacggctc 
cggccggtgc 
ggaacggcac 
agggcaactc 
ctgccggttg 
ataccgttga 
cgccctcggg 
ccagcgccca 
gcctcgccac 
acgacaacag 
ccgagggcaa 
acatccgctg 
cgtccagcac 
gcacgcagac 
cgtcgggcac 
acttgcctct 
agacatnggg 
aaatgaaaaa 



actgccgttg 
taccagc^^c 
gtgcgtggcc 
aaactacaac 
gaccrgtggc 
ctcgggcagc 
cgtctctccc 
cggccaggag 
gctctacctg 
caactacggg 
cctcaacact 
gagggcgaat 
cggcttcaac 
cacctccaag 
caaccttgtg 
gcccggcggc 
catgggcaag 
ccagtacatg 
cccatccaac 
gggagac . :t 
gacgttttcg 
tcactggggg 
tacgtgccag 
ggtctgtcca 
cttaatatat 
gaacttatca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1588 
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<210> 7 
<211> 745 
<212> DNA 

<2I3> Trichoderma reesei 



<220> 

<221> terminator 
<222> (1) . . (745) 

<22 3> T . reesei cbhl terminator 



<400> 7 

ggacctaccc 

tgcgccagcg 

tccgtgcgaa 

ggagctacat 

atacggtcaa 

ttggcaaatt 

taacggaata 

cccgtagaat 

caatgttgat 

cgaacgcggt 

gcaacagtgg 

tggaaagcac 

acgatgacaa 



agtctcacta 
gcacaacttg 
agcctgacgc 
ggccccgggt 
ctcatctttc 
gtggctttcg 
gaagaaagag 
cgccgctctt 
at tgttccgc 
agtggctgct 
aaattagtgg 
tgt tggagac 
cgtagccgag 



cggccagtgc 
ccaggtcctg 
accggtagat 
gatttatttt 
actggagatg 
aaaacacaaa 
gaaattaaaa 
cgtgtatccc 
cagtatggct 
gccaattggt 
cgcaataatt 
caacttgtcc 
gaccc 



ggcggtattg 
aacccttact 
tcttggtgag 
ttttgtatct 
cggcctgctt 
acgattcctt 
aaaaaaaaaa 
agtaccacgt 
ccacccccat 
aatgaccata 
gagaacacag 
gttgcgaggc 



gctacagcgg 
actctcagtg 
cccgtatcat 
acttctgacc 
ggtattgcga 
agtagccatg 
aacaaacatc 
caaaggtatt 
ctccgcgaat 
gggagacaaa 
tgagaccata 
caacttgcat 



ccccacggtc 
cctgtaaagc 
gacggcggcg 
cttttcaaat 
tgttgtcagc 
cattttaaga 
ccgttcataa 
catgatcgtt 
ctcctcttct 
cagcataata 
gctggcggcc 
tgctgtcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

745 



<210> 8 
<2ll> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 8 

taaccgcggt 10 



<210> 9 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 9 

ctagaccgcg gttaat 16 



<210> 10 
<211> 1232 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<221> promoter 
<222> (1) . . (1232) 



WO 00/58342 



PCTYFIOO/00249 



<22 3> T. reesei gpdl promotor 



<400> 10 

gtcgacacga 

atgtgacaat 

caccaaggtg 

gttagtgaag 

cagtacgtca 

gattgcctca 

agattgagag 

ccccagccgg 

tgcttggcaa 

gcagttgcta 

taggatggcg 

aagttgatga 

gatgccgcat 

ccactatctc 

gtgttaatct 

cttcctcctc 

agcctcgcaa 

gcccgacgat 

gccccgccag 

tccctctcgt 

cacgcagctc 



tatacaggcg 

attgactgcg 

catgcatcat 

gcaccgtgat 

agtaacgtca 

atcacatagt 

ctaccttaca 

cgcgattgtg 

tcgtgctcag 

aggtacctag 

gctggcgggg 

taccgttttg 

caccgttgtg 

aggtaaccag 

ttccacctcc 

ttcttctcca 

atcctcagtt 

gcggcttctg 

gactctcctg 

ttcttcctaa 

aattcgcaga 



cggctgatga 

aggaaccatc 

aaggattctc 

ggcaagatag 

aagccgccct 

acctacctat 

tcaatatggc 

tgtactaggt 

ctgttaggtt 

ggagggattc 

ccgaagctgg 

ccagattaga 

tcagaaacga 

gtactaccag 

ctcctcaatc 

taaccattcc 

tgtatgtacg 

ttcgcctgcc 

cgtcaccaat 

caaacaacca 

tacaaatcta 



taatgatgat 

aggtgtgtat 

ctcagctcac 

aaccactatt 

cccgtaacct 

gcattatggg 

cagcacctct 

aggctcgtac 

gtacttgtat 

aacgagcccc 

gaactcgcca 

tgcgagaagc 

ccaagctaag 

ctaccctacc 

ttcttttccc 

taacaacatc 

tacgtactac 

cctcctctca 

ttttttccct 

ccaccaaaat 

ga 



cgagcatgac 

ggatggaacc 

caacaacgaa 

gcatctgcgc 

cgcccgttgt 

cggcctcaac 

tcggcgatac 

tataccagca 

ggtacttgta 

gcttccaatg 

acagtcatat 

agcatgaatg 

caactaaggt 

tgccgtgcct 

tcctctcctc 

gacattctct 

aatcatcacc 

ctcgtgccct 

atttacccct 

ctctttggaa 



ttgatgctgt 

attctgtaac 

cgatggccat 

ttcccacgca 

tgctccc jc 

ccaccccccc 

atactcgcca 

ggagaggtgc 

aggtggtcat 

tccatctgga 

gtaatagctc 

tcgctcatcc 

accttaccgt 

acctgcttta 

tttttttttt 

cctaatcacc 

acgatcgtcc 

tgacgagcta 

cctccctctc 

gctcacgact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1232 



<210> 11 

<211> 1129 

<212> DNA 

<213> Trlchoderma reesei 
<220> 

<221> terminator 

<222> (1) . . (1129) 

<223> T. reesei gpdl terminator 



<400> 11 

ggatcccgag 

atgacgaata 

ccttgactgc 

gcgtgcgagc 

cttggcttac 

atcgcttcta 

aggtcgattc 

agaaggaatg 

ccaaaagcac 

agaatgatct 

aacaaaaaaa 

tgaggtgacc 

ccgcgcctcg 

gaagcggggt 

atcaaacagc 

accgaggaac 

gttcctcaat 

aggagagaga 

gaatcagaag 



cattgtctat 
gagccaatct 
ccattcaaac 
ccgtttgtac 
gatcatgttt 
tgattaggta 
caatcatgat 
gtaggacctc 
ctacctatct 
caagaggccg 
gaaagatgag 
tgtcagggtt 
cctcttctct 
tatcgacgcc 
ccccgagtga 
aataaagcga 
caatcccaac 
tatttccact 
tgacaacaac 



gaatgcaaac 

gctccacttg 

aattgtaaag 

gttattttga 

gcgatcttcg 

ctcagacaac 

ctggactggg 

tcaggcgtcc 

acctagagta 

tgaacgcaga 

gactagaaga 

taccgcaaac 

cctgtccttt 

accaaagat c 

tcagatccgt 

gtaaataaca 

ccccctaaaa 

accgctcacc 

gggttgagca 



aaaaatagta 

actatcttgt 

gaatatagct 

gaaagcgttg 

gtaagaatac 

acatgggaaa 

gtattccatc 

accatctgtg 

actgcacgag 

aacacactcc 

tggagtattt 

cgtcggtggt 

catagaatcc 

ttgtcttggt 

aaaagaagaa 

tcaaaataag 

gcccttcccc 

accaagtgag 

catgggatat 



aataaatagt 

gactgtatcg 

acaagttatg 

ccatcacatg 

acagagtaac 

caagataacc 

taagccatag 

ctgcaaatcc 

aaaagaaaag 

tcccaacttt 

ccttcttaga 

tctatccaat 

cgtctccttg 

gacttatcaa 

gaagagtacg 

agtctcgttg 

ccatggtata 

gcttgccgag 

cggcgcgcc 



aattctggcc 

tatgtcgaac 

tctcacgttt 

ctcacagtca 

gattatctcc 

atcgcatgca 

taccctc^cig 

aagaaacccc 

gagcagaaga 

tcaagttttg 

gagctctcgg 

taatcaagtc 

ttgcttgatc 

tcctttggtg 

atttaaccag 

aaaattactt 

tcccggcagt 

agaagaggat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1129 
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<210> 12 
<211> 5733 
<212> DNA 

<213> Aspergillus nldulans 



<220> 

<223> (1-5733) Sequence of plasmid pAN52-l 



<220> 

<221> promoter 
<222> (1) . . <2129) 

<223> A . nidulans gpdA promoter 



<220> 

<221> gene 

<222> (2130) . . (2304) 

<22 3> A. nidulans gpdA gene 

<220> 

<221> terminator 

<222> (2305) . . (3071) 

<22 3> A . nidulans trpC terminator 



<220> 

<221> misc_feature 

<222> (3072) . . (5726) 

<223> pUC18 from Sail to EcoRI 



<400> 12 

caattccctt 

cgtttatatc 

cttgaagtaa 

agcggcgagg 

cgccagctca 

ttcaagtaca 

tggttaagat 

ctgccaacag 
ccgagaagaa 

c 99 a 9 c 99 t 9 
ggaggtagca 
gagggtcctt 
ccttttattc 
ggtt ttcgaa 
acgtggcact 
gttgctagca 
catgatctct 
aagctgtaag 
gtaccattta 
aaagtcgtat 
ataaacttta 
aaatctgtcc 
gttgacctag 
cctccaccat 
ctttctggca 
gccagacagc 
ccctccgccc 
atcggagaat 
ggggtgtata 



gtatctctac 
ttgcatcgtc 
tctctgcaga 
agcctcgtcc 
ttaagagcca 
ccaacgctct 
ccacgtcttc 
ctttctcagc 
ctggaggggt 
tcaagaggat 
tagct tcatt 
tggctttcaa 
accaatgagc 
atacatccgg 
attgatcatc 
gttccaggtg 
agttagctca 
gatttcggca 
at tctatt tg 
agctaccagt 
agcctaaact 
agatcatggt 
ctgattctgg 
ttgtagaaaa 
tgcggagaga 
tctggcggct 
cgaagtggaa 
atggagcttc 
gccgtcggcg 



acacaggctc 
ccaaagctat 
tctttcgaca 
tgtcacaact 
gttcatgggc 
tctgatccag 
gggaagccag 
cagggccagc 
ggtgtcaagg 
at tcttcgac 
tggatttgct 
tattctcaag 
attggaatga 
atgtcgaagg 
cgatagctct 
gaatgttatg 
ccacaaaagt 
cggctacgga 
tgtttgatcg 
gaggaagtgg 
atacagaata 
tgaccggtgc 
agtgacccag 
atgtgacgaa 
cggacggacg 
ctgaggtgca 
aggctggtgt 
atcgaatcac 
aaatagcatg 



aaatcaataa 
tggcgggata 
ctgaaatacg 
accaacatgg 
gttggcatga 
tcgatcatcc 
cgactggtga 
ccaagaccga 
aggagtaagc 
tctgtattat 
t tccaggctg 
tatctcgagt 
acatgaatct 
cttggggcac 
gcaaagggcg 
atgagcattg 
cagacggcgt 
agacggagaa 
agacctaata 
actcaaatcg 
agataggtgg 
ctggatcttc 
agggtcatga 
ctcgtgagct 
cagagagaag 
gtggatgatt 
gcccctcgtt 
cggcagtaag 
ccattaacct 



gaagaacggt 
ttctgtttgc 
tcgagcctgc 
agtacgataa 
tggccgtcat 
gctgaaggcg 
cctccagcgt 
caaggcct cc 
tccttattga 
agataagatg 
agactctagc 
ttgaacttat 
gaggactgca 
ctgcgttggt 
ttgcacaatg 
tattaaatca 
aaccaaaagt 
gccaccttca 
cagcccctac 
acttcagcaa 
agagcttat a 
ctatagaatc 
cttgagccta 
ctgtacagtg 
ggctgagtaa 
attaatccgg 
gaccaagaat 
cgaaggagaa 
aggtacagaa 



tcgtcttttt 


60 


agttggctga 


120 


tccgcttgga 


180 


gggccagttc 


240 


gcatctgtac 


300 


ctttcgaatc 


360 


ccctttaagg 


420 


ctccagaacg 


480 


agtcggagga 


54 0 


atgaggaatt 


600 


ttggagcata 


660 


tccctgtgaa 


720 


atcgccatga 


780 


tgaatttaga 


840 


caagtcaaac 


900 


ggagatatag 


960 


cacacaacac 


1020 


gtggactcga 


1080 


aacgaccatc 


1140 


catctcctgg 


1200 


ccgagctccc 


1260 


atccttattc 


1320 


aaatccgccg 


1380 


accggtgact 


1440 


taagccactg 


1500 


gaccggccgc 


1560 


ctattgcatc 


1620 


tgtgaagcca 


1680 


gtccaattgc 


1740 
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ttccgatctg 
ccggcgcgta 
ctctcctgct 
cagaccggga 
gtccctcagt 
tgacaaggtc 
ctgctctttt 
acctttattt 
attcctttga 
cccgcctgag 
gacgaatctg 
gctcaggatc 
tttaatagct 
agctcatctg 
cgaagagaag 
caacggtcgt 
tatttgtctc 
aaaattccgt 
tcaagcctac 
atggtataca 
ccggccgaaa 
ttgtttagag 
catctccact 
gactgggaaa 
agctggcgta 
aatggcgaat 
cgcatatggt 
cacccgccaa 
agacaagctg 
aaacgcgcga 
ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaaccg 
gcggataaag 
gataaatctg 
ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagcgt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggnt 
ctacagcgtg 
ccggtaagcg 



gtaaaagatt 
agctccctaa 
ttgcccggtg 
acacaagctg 
ccccggtagg 
gttgcgtcag 
ctttcctctt 
cccctaagna 
acctttcagt 
cagacatcac 
gatataagat 
tcgataagat 
ccatgtcaac 
caatgcatta 
aatagcttag 
caagagacct 
taattgtact 
caccagcncc 
aggacacaca 
atagtaacca 
cttttttaca 
gtaatccttc 
cgacctgcag 
accctggcgti 
atagcgaaga 
ggcgcctgat 
gcactctcag 
cacccgctga 
tgaccgtctc 
gacgaaaggg 
cttagacgtc 
tctaaataca 
aatattgaaa 
ttgcggcatt 
ctgaagatca 
tccttgagag 
tatgtggcgc 
actattctca 
gcatgacagt 
acttacttct 

gggatcatgt 

acgagcgtga 
gcgaactact 
tcgcaggacc 
gagccggtga 
cccgtatcgc 
agaccgctga 
catatatact 
tcctttttga 
cagaccccgt 
gctgcttgca 
taccaactct 
ttctagtgta 
tcgctctgct 
ggttggacnc 
cgtgcacaca 
agctatgaga 
gcagggtcgg 



cacgagatag 
ttggcccatc 
tatgaaaccg 
gcagtcgacc 
cagctttgcc 
tccaacattt 
tcttttccca 
agcactttgc 
tcgagctttc 
canggatcca 
cgtcggtgtc 
acgcccattt 
aagaataaaa 
atgcattgac 
cagagctatt 
acgagactga 
ttgacatgct 
tgggttcgca 
ttcatcgtag 
tgcatggttg 
actctcctat 
ttnctagaag 
gcatgcaagc 
tacccaactt 
ggcccgcacc 
gcggtatttt 
tacaatctgc 
cgcgccctga 
cgggagctgc 
cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 
agttatctac 
gacaggtgcc 
ctagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 



taccttctcc 
cggcatctgt 
gaaaggccgc 
catccggtgc 
ccgtctgtcc 
gtcgccatat 
tcttcagtat 
tacatccata 
ccacttcacc 
cttaacgtxa 
gatgtcagct 
gtccaagcag 
cgcgttttcg 
tgcaacctag 
ttcattttcg 
ggaatccgct 
ccccttcttt 
aagataattg 
gtataaacct 
cctagtgaat 
gagtcgttta 
tcctcgtgta 
ttggcactgg 
aatcgccttg 
gatcgccctt 
ctccttacgc 
tctgatgccg 
cgggct tgtc 
atgtgtcaga 
cgcctatttt 
tttcggggaa 
t atccgctca 
atgagtattc 
gtttttgctc 
cgagtgggtt 
gaagaacgt t 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 
acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 



gaagtaggta 
agggcgtcca 
tcaggagctg 
tctgcactcg 
gcccggtgtg 
tttcctgctc 
attcatcttc 
ctccatcctt 
gcagcttgac 
ctgaaatcat 
ccggagttga 
caaagagtgc 
ggtttacctc 
taacgccttn 
ggagacgaga 
cttggctcca 
actctgatag 
catgtttcct 
cgaaatcant 
gctccgtaac 
cccagaatgc 
ctgcgtaagc 
ccgtcgtttt 
cagcacatcc 
cccaacagtt 
atctgtgcgg 
catagttaag 
tgctcccggc 
ggttttcacc 
tataggt taa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttccaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggcnggctg 
tcgcagcact 
gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 



gagcgagtac 
aatatcgtgc 
gccagcggcg 
acctgccgag 
tcggcggggt 
tccccaccag 
ccatccaaga 
cccatccctt 
taacagccac 
caaacagcct 
gacaaatggt 
cttctagtga 
ttccagatac 
caggctccgg 
tcaagcagat 
cgcgactata 
cttgactatg 
ccttgaactc 
tcctactaag 
acccaatacg 
acaggtacac 
gcccactcca 
acaacgtcgt 
ccctttcgcc 
gcgcagcctg 
tatttcacac 
ccagccccga 
atccgcttac 
gtcatcaccg 
tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gtcgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 
tatggatgaa 
accgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 
gtcgggctga 
actgagatac 
ggacaggtat 
gggaaacgcc 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
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tggtatcttt atagtcctgt cgggtttcgc cacctctgac tngagcgtcg atttttgtga 5280 
tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc 534 0 
ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg 5400 
gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag 54 6 0 
cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa cacgcaaacc gcctctcccc 5 52 0 
gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc 5580 
agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac 5640 
tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga 5700 
aacagctatg accatgatta cgaattgcgg ccg 5733 



<210> 13 
<211> 39 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 13 

gtcaaccgcg gactgcgcat catgaagttc ttcgccatc 3 9 



<210> 14 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 5' primer 
<400> 14 

tctagcaagc ttggctctag ttctggaacc gcaccaggcg gcagcaacgg caacggcaat 60 
gtttgc 66 



<210> 15 

<211> 30 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 15 

tcgtacaagc tttcaagcac cgacggcggt 3 0 



<210> 16 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR 5' primer 



<400> 16 

tctagctcta gaagcaacgg caacggcaat gtt 



33 
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<210> 17 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR 3' primer 
<400> 17 

tgctagtcga cctgctagca gcaccgacgg cggtctg 37 



<210> 18 
<211> 4614 
<212> DNA 

<213> Saccharomyces cerevlsiae 



<220> 
<223> S. 



cerevlslae FLOl coding sequence 



<400> 18 

atgacaatgc 

gtggcctcag 

ataaattttt 

gcttatggat 

attgattata 

tcccatggaa 

tactggagta 

acaggttatt 

gactctgcaa 

ccgccgatca 

ccacctaata 

gtttactcga 

accactgtaa 

tctaactgta 

gaaccatgga 

aacggcgttc 

atcataacta 

acagtcactg 

acaacagcca 

tctaccgaat 

gtcatcagaa 

acttttacct 

gatgaaacca 

cagccatgga 

aacggtttgc 

gccatgacta 

accgtcaccg 

acaacagcca 

tccactgaaa 

gtcatcagaa 

acttttacct 

gatgagacca 

cagccatgga 

aacggtttgc 

gccatgacta 

accgtcaccg 



ctcatcgcta 
gagccacaga 
accagtat tc 
atgcctcaaa 
atattccctg 
actggggatg 
ctgatttatt 
ttttaccacc 
ttctatcagt 
catcaacgaa 
tcgaaggaac 
acgctgtttc 
gtgatgactt 
ctgtccctga 
ccggtacttt 
caactgacga 
caactgagcc 
gcaccaatgg 
ctactgccat 
tgaccacagt 
caccaacaac 
ctacatccac 
tcattgtcat 
acgacacttt 
caactgatga 
caactcagcc 
gtaccaatgg 
ctactgccat 
tgaccaccgt 
caccaacaac 
ctacttctac 
tcattgtcat 
acgacacttt 
caactgatga 
caactcagcc 
gtaccaacgg 



tatgtttttg 
ggcgtgctta 
attgaaagat 
aaccaaacta 
tgttagttca 
caaaggaatg 
tggtttctat 
acagacgggt 
aggtggtgca 
ctttaccatt 
cgtctatatg 
ttggggtaca 
cgaagggtac 
cccttcaaat 
cacttccaca 
aaccgtcatt 
atggaacagc 
tgtacgaact 
aactacaact 
caccggtacc 
agccactact 
tgaaatgacc 
cagaacacca 
tacctctaca 
aaccatcatt 
atggaacgac 
tttgccaact 
gactacaact 
caccggtacc 
agccactact 
cgaattgacc 
cagaacacca 
tacctctaca 
aaccatcatt 
atggaacgac 
tttgccaact 



gcagtcttta 
ccagcaggcc 
tcctccacat 
ggttctgtcg 
tcaggcacat 
ggtgcttgtt 
actaccccaa 
tcttacacat 
accgcgttca 
gacggtatca 
tacgctggct 
cttccaatta 
gtctattcct 
tatgctgtca 
tctactgaaa 
gtcatcagaa 
acttttacct 
gacgaaacca 
gagccatgga 
aatggtttgc 
gccatgacta 
accgtcaccg 
acaacagcca 
tccactgaaa 
gtcatcagaa 
acttttacct 
gatgagacca 
cagccatgga 
aacggtttgc 
gccataacta 
acagtcaccg 
acaacagcca 
tccactgaaa 
gtcatcagaa 
acttttacct 
gatgagacca 



cacttctggc 
agaggaaaag 
attcgaatgc 
gaggacaaac 
ttccttgtcc 
ctaatagtca 
caaacgtaac 
tcaagtttgc 
actgttgtgc 
agccatgggg 
actattatcc 
gtgtgacact 
ttgacgatga 
gtaccactac 
cgaccaccgt 
ctccaacaac 
ctacttctac 
tcattgtaat 
acagcacttt 
caactgatga 
caactcagcc 
gtaccaacgg 
ctactgctat 
tgaccaccgt 
caccaacaac 
ctacatccac 
tcattgtcat 
acgacacttt 
caactgatga 
caactgagcc 
gtaccaatgg 
ctactgccat 
tgaccaccgt 
caccaacaac 
ctacatccac 
tcattgtcat 



actaactagt 
tgggatgaat 
agcatatatg 
tgatatctcg 
tcaagaagat 
aggaattgca 
cctagaaatg 
tacagttgac 
tcaacagcaa 
tggaagtttg 
aatgaaggtt 
tccagatggt 
cctaagtcaa 
aactacaacg 
caccggtacc 
tgctagcacc 
cgaattgacc 
cagaacacca 
tacctctact 
gaccatcatt 
atggaacgac 
tttgccaact 
gactacaact 
caccggtacc 
agccactact 
tgaaatgacc 
cagaacacca 
tacctctaca 
aaccatcatt 
atggaacagc 
tttgccaact 
gactacaact 
caccggtacc 
agccactact 
tgaaatgacc 
cagaacacca 
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240 

300 

360 
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480 
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600 
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720 

780 

840 

900 

96G 

1020 

1080 

1140 

1200 

1260 

132 0 

138C 

144C 

1500 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 

2160 



